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Abstract 

As the LHC prepares to start taking data, this review is intended to provide a 
QCD theorist's understanding and views on jet finding at hadron colhders, including 
recent developments. My hope is that it will serve both as a primer for the newcomer 
to jets and as a quick reference for those with some experience of the subject. It is 
devoted to the questions of how one defines jets, how jets relate to partons, and to 
the emerging subject of how best to use jets at the LHC. 
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1 Introduction 



It is common to discuss high-energy phenomena involving quantum chromodynamics (QCD) 
in terms of quarks and gluons. Yet quarks and gluons are never visible in their own right. 
Almost immediately after being produced, a quark or gluon fragments and hadronises, 
leading to a collimated spray of energetic hadrons — a jet. Jets are obvious structures 
when one looks at an event display, and by measuring their energy and direction one can get 
close to the idea of the original "parton" . The concept of a parton is, however, ambiguous: 
the fact that partons have divergent branching probabilities in perturbative QCD means 
that one must introduce a prescription for defining what exactly one means by the term. 
Similarly, jets also need to be defined — this is generally done through a jet definition, 
a set of rules for how to group particles into jets and how to assign a momentum to the 
resulting jet. A good jet definition can be applied to experimental measurements, to the 
output of parton-showering Monte Carlos and to partonic calculations, and the resulting 
jets provide a common representation of all these different kinds of events. 

Jets are used for a wide range of physics analyses. One way of classifying their uses 
is according to the different possible origins for the partons that give rise to the jets. At 
hadron colliders (and in photoproduction), one of the best studied jet observables is the 
inclusive jet spectrum, related to the high-momentum-transfer 2 — )■ 2 scattering of partons 
inside the colliding (anti)protons. In this kind of process the energy of the jet (in the 
partonic centre-of-mass frame) is closely related to that of the parton in the proton that 
underwent a hard scattering and the inclusive jet spectrum therefore contains information 
on the distributions of partons inside the proton (e.g. refs. [H El |3l H]), and also on the 
strength of their interaction. 

Another origin for the partons that lead to jets is that they come from the hadronic 
decay of a heavy particle, for example a top quark, a Higgs boson, or some other yet-to-be 
discovered resonance. If, at tree-level, the heavy particle decays to many partons (e.g. 
through a cascade of decays) then a high multiphcity of corresponding jets may be a sign 
of the presence of that particle (as used for example in SUSY searches, such as ref. ^]); and 
the sum of the momenta of the jets (or of the jets, leptons and missing transverse energy 
information) should have an invariant mass that is close to that of the heavy particle, a 
feature used for example in measurements of the top-quark mass [HI E]. 

Jets may also originate radiatively, for example from the emission of a gluon off some 
other parton in the event. The rate of production of such jets provides information on 
the value of strong coupling (for example refs. [SI [H [TOl [H] 112]) and is related also to the 
colour structure of events. One use of this is, for example, to help discriminate between 
Higgs-boson production through electroweak vector-boson fusion (which radiates less) and 
through gluon-fusion (which radiates more). Radiative emission of partons is also one of 
the main backgrounds to multi-jet signals of new physics; consistently predicting such back- 
grounds involves matching tree-level matrix-element calculations with Monte Carlo parton 
showers, for which jet-definitions provide a powerful way of avoiding double counting, by 
prescribing which emissions should be the responsibility of the matrix-element (those that 
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lead to extra jets) and which ones should come from the parton showers (those that don't) 

Though most uses of jets essentially identify a jet as coming from a single parton, one 
should never forget how ambiguous this association really is, and not just because partons 
are an ill-defined concept. For example, when a highly boosted W or Z boson decays to 
two partons, those partons may be so collimated by the boost that they will lead to a 
single jet, albeit it one with substructure. And QCD radiative corrections also inevitably 
give substructure to jets. Much as the number of jets and their kinematics can be used to 
learn about the properties of the event, so can the structure within the jets. 

Given the variety of these and other related possible uses of jets, it should not be 
surprising that there is no single optimal way of defining jets and, over the 30 years that 
have passed since the first detailed proposal for measuring jets [15], many jet definitions 
have been developed and used. The ideas behind jet definitions are rather varied. One of 
the aims of this review (section |2]) is to provide an overview of the different kinds of jet 
definition that exist. Given that the main use of jets in the coming years will be at the 
Large Hadron Collider at CERN (LHC), the emphasis here and throughout the review will 
be on hadron-collider jets, though a number of the ideas in jet finding actually have their 
origins in studies of e^e~ and ep collisions. 

One of the characteristics of the LHC is that its particle multiplicity is expected to be 
much higher than in preceding colliders. Some part of the increase is due to the LHC's 
higher energy, but most of it will be a consequence of the multiple minimum-bias inter- 
actions (pileup) that will occur in each bunch crossing. High multiplicities pose practical 
challenges for the computer codes that carry out jet finding, because the computing time 
that is required usually scales as some power of the multiplicity, A^. Until a few years ago, 
this was often a limiting factor in experimental choices of jet finding methodology. Recent 
years' work (described in section [3]) has shown how these practical issues can be resolved 
by exploiting their relation to problems in computational geometry. This makes it easier 
for LHC's jet-finding choices to be based on physics considerations rather than practical 
ones. 

Given a set of practical jet algorithms, the next question is to establish their similarities 
and differences. Any jet algorithm will form a jet from a single hard isolated particle. 
However, different jet definitions may do different things when two hard particles are close 
by, when a parton radiates a soft gluon, or when the jet is immersed in noise from pileup. 
Section H] examines standard and recent results on these issues, for the most important of 
the current jet algorithms. 

Once one has understood how jets behave, the final question that needs to be addressed 
is that of determining the jet definitions and methods that are optimal for specific physics 
analysis tasks. One might call this subject "jetography" , in analogy with photography, 
where an understanding of optics, of one's light sensor, and of properties of the subject 
help guide the choice of focus, aperture and length of exposure. Ultimately, it is neither 
the photons in photography, nor the jets in jetography that are of interest; rather it is 
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the objects (new particles, PDFs, etc.) that they help you visualise or discover. In the 
context of the LHC, it is probably fair to say that jetography is still in its infancy, hence 
the title of the review. Nevertheless some first results have emerged in the past couple of 
years, notably (as discussed in section |5]) with respect to simple dijet mass reconstructions, 
hadronic decays of boosted heavy particles, and the question of limiting the effect of pileup. 

One thing that this review does not do is examine the wide range of uses of jets in LHC 
and other experiments' analyses, aside from the brief discussion given above. This is a vast 
subject, and to obtain a full overview probably requires that one consult the main ATLAS 
and CMS physics analysis programme documents [Ml HZ] and the "LHC primer" [12] , as 
well as recent work by the Tevatron and HERA, summarised for example in [T9| 120] . Other 
reviews of jets in recent years include [2T| 122]. Finally a topic that is barely touched upon 
here is the nascent field of jet finding in heavy-ion collisions, for which the reader is referred 
to [231121]. 

2 Jet algorithms 

Jet algorithms provide a set of rules for grouping particles into jets. They usually involve 
one or more parameters that indicate how close two particles must be for them to belong to 
the same jet. Additionally they are always associated with a recombination scheme, which 
indicates what momentum to assign to the combination of two particles (the simplest is the 
4- vector sum). Taken together, a jet algorithm with its parameters and a recombination 
scheme form a "jet definition" . 

An accord as to some general properties of jet definitions, the "Snowmass accord" , was 
set out in 1990 [23] by a group of influential theorists and experimenters, and reads as 
follows 

Several important properties that should be met by a jet definition are [3]: 

1. Simple to implement in an experimental analysis; 

2. Simple to implement in the theoretical calculation; 

3. Defined at any order of perturbation theory; 

4. Yields finite cross sections at any order of perturbation theory; 

5. Yields a cross section that is relatively insensitive to hadronisation. 

where ref. [3] is given below as [25] • It is revelatory that ref. [23] is entitled Toward a 
standardization of jet definitions" (my italics). If one reads the rest of the article, one 
realises that it wasn't evident at the time what the standard jet definition should actually 
be, nor was there a clear path towards satisfying the Snowmass accords, at least for hadron 
colliders. 

When the next major community-wide discussion on jets took place, in 2000, in prepara- 
tion for Run II of the Tevatron [2T], new jet algorithms had been invented [27 t l2H t 12 ^ [5CT t [5T] . 



6 



old algorithms had been patched [32] and it is probably fair to say that the community 
had almost satisfied the Snowmass requirements. Nevertheless, the recommendations of 
the Run II workshop were followed in only part of subsequent Tevatron work and, until 
recently, had also been ignored in much of the preparatory work towards LHC 

This means that there are currently very many hadron-collider jet algorithms in use — 
some dating from the 80's, others from the 90's. The situation is further confused by the 
fact that different algorithms share the same name (notably "iterative cone"), and that 
there is no single source of information on all the different algorithms. Additionally, it has 
not always been clear how any given algorithm fared on the Snowmass requirements. 

The purpose of this section is to give an overview of all the main different algorithms, 
including some of the most recently developed ones, so as to provide the background for 
anyone reading current jet work from both the theory and experimental communities. 

The section's organisation reflects the split of jet algorithms into two broad categories. 
Firstly those based in one form or another on "cones". They can be thought of as "top- 
down" algorithms, relying on the idea that QCD branching and hadronisation leaves the 
bulk features of an event's energy flow unchanged (speciflcally, energy flow into a cone). 
Secondly, sequential recombination algorithms, "bottom-up" algorithms that repeatedly 
recombine the closest pair of particles according to some distance measure, usually related 
to the divergent structure of QCD matrix elements. 

The nomenclature used to distinguish the types of jet algorithm (notably cones) is 
currently not always uniform across the fleld. That used here follows the lines set out 
in [331 E]. 

Before continuing, a note is due concerning the completeness of this section. Its aim is 
to communicate the essential ideas about many of the main jet algorithms (a more concise 
overview is given in [31]). It will not describe every detail of every single jet algorithm. 
Where possible, references will be supphed to more complete descriptions. In some cases, 
no such reference exists, and the interested reader is then advised to consult computer code 
for the given jet algorithm. 

2.1 Cone algorithms 

The flrst-ever jet algorithm was developed by Sterman and Weinberg in the 1970's |15] . 
It was intended for e~^e~ collisions and classifled an event as having two jets if at least a 
fraction 1 — e of the event's energy was contained in two cones of opening half-angle 6 (and 
hence is known as a "cone" algorithm). This deflnition made it possible to have a fully 
consistent perturbative QCD calculation of the probability of having two jets in an event. 

The two parameters 6 and e reflect the arbitrariness in deciding whether an event has 
two or more jets. Typically one would avoid taking extreme values (e too close to or 1, 5 
too close to zero), but apart from that the optimal choice of 6 and e would depend on the 
specific physics analysis being carried out. The presence of separate angular and energy 
parameters to dictate the characteristics of the jet finding is typical of cone algorithms, as 
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we shall see below. 

Cone algorithms have evolved substantially since [15] and are today mostly used at 
hadron colliders. The changes reflect the fact that in hadron collisions it doesn't make 
sense to discuss the total energy (since most of it is not involved in the hard reaction, 
and goes down the beam pipe), that it isn't always obvious, physically or computationally, 
where to place the cones, and that issues arise when trying to define events with more than 
two jets (with the associated problem of "overlapping" cones). 

2.1.1 Iteration 

Let us first examine the question of where to place the cones. Most of today's widely used 
cone algorithms are "iterative cones" (IC). In such algorithms, a seed particle i sets some 
initial direction, and one sums the momenta of all particles j within a circle ( "cone" ) of 
radius R around i in azimuthal angle and rapidity y (or pseudorapidity 1])]^ i.e. taking 
all j such that 



where yi and 0i are respectively the rapidity and azimuth of particle i. The direction of the 
resulting sum is then used as a new seed direction, and one iterates the procedure until the 
direction of the resulting cone is stable. The dimensionless parameter R here, known as the 
jet radius, replaces the angular scale 6 that was present in the original Sterman- Weinberg 
proposal. The Sterman- Weinberg e parameter is less-directly mirrored in hadron-collider 
cone algorithms. Rather, most physics analyses will use a cone algorithm to obtain jets 
without any specific energy cut, but then will consider only those jets that are above a 
certain transverse-momentum threshold. 

To be fully specified, seeded iterative jet algorithms must deal with two issues: 

• What should one take as the seeds? 

• What should one one do when the cones obtained by iterating two distinct seeds 
"overlap" (i.e. share particles)? 

Different approaches to these issues lead to two broad classes of cone algorithm. 

2.1.2 Overlapping cones: the progressive removal approach 

One approach is to take as one's first seed the particle (or calorimeter tower) with the 
largest transverse momentum. Once one has found the corresponding stable cone, one 

^These are standard hadron-collider variables. Given a beam along the z-direction, a particle with 
longitudinal momentum pz, energy E and angle 6 with respect to the beam (longitudinal) direction has 
rapidity y = i In and pseudorapidity ij = — In tan 0/2. Massless particles have y = rj. Differences in 

rapidity are invariant under longitudinal boosts, whereas differences in pseudorapidity are invariant only 
for massless particles. Where an analysis in e~^e~ will use particles' energies and the angles between the 
particles, an analysis in a pp collider will often use pt (or Et) and Ai?^^- (defined either with rapidities or 
pseudorapidities) . 
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calls it a jet and removes from the event all particles contained in that jet. One then 
takes as a new seed the hardest particle/tower among those that remain, and uses that to 
find the next jet, repeating the procedure until no particles are left (above some optional 
threshold). This avoids any issue of overlapping cones. A possible name for such algorithms 
is iterative cone with progressive removal (IC-PR) of particles. 

IC-PR algorithms' use of the hardest particle in an event gives them the drawback 
that they are collinear unsafe: the splitting of the hardest particle (say pi) into a nearly 
collinear pair {pia, pu) can have the consequence that another, less hard particle, p2, 
pointing in a different direction and with pt,ia, Pt,ib < Pt,2 < Pt,i, suddenly becomes the 
hardest particle in the event, thus leading to a different final set of jets. We will return to 
this in section 12.1.41 

Fixed cones. A widespread, simpler variant of IC-PR cone algorithms is one that does 
not iterate the cone direction, but rather identifies a fixed cone (FCjl around the seed 
direction and calls that a jet. It starts from the hardest seed and progressively removes 
particles as the jets are identified (thus FC-PR). It suffers from the same collinear unsafety 
issue as the IC-PR algorithms. 

IC-PR and FC-PR algorithms are often referred to as UAl-type cone algorithms, even 
though the algorithm described in the original UAl reference [35] is somewhat different H 
This may be due to different versions of the UAl algorithm having been presented at 
conferences prior to its final publication [53] Fl 

2.1.3 Overlapping cones: the split— merge approach 

Another approach to the issue of the same particle appearing in many cones applies if one 
chooses, as a first stage, to find all the stable cones obtained by iterating from all particles 
or calorimeter towers (or those for example above some seed threshold ~ l-2GeV)0 One 
may then run a split-merge (SM) procedure, which merges a pair of cones if more than a 
fraction / of the softer cone's transverse momentum is in particles shared with the harder 
cone; otherwise the shared particles are assigned to the cone to which they are closer. A 
possible generic name for such algorithms is IC-SM. The exact behaviour of SM procedures 

^ "Fixed cone" can be an ambiguous term. In particular, in some contexts it is used to refer to cones 
whose shape is fixed rather than cones whose position is fixed. 

^ The UAl algorithm [3S] proceeds as follows: the particle (or cell) with highest Et starts a jet; working 
through the list of particles in decreasing Et, each one is added to the jet to which it is closest, as long 
as it is within AR < R (Ai?^ = iS.rf' + A0^, R taken to be 1); otherwise, the particle initiates a new jet. 
Finally, once all remaining particles have Et < 2.5 GeV, each particle is simply added to the jet nearest 
in r], (j) if its transverse momentum relative to the jet axis is less than 1 GeV and it is no further than 45° 
in direction from the jet axis. 

"^I am grateful to Torbjorn Sjostrand for comments on this point. 

^In one variant, CDF's JetClu |35], "ratcheting" is included, which means that during iteration of cone, 
all particles included in previous iterations are retained even if they are no longer within the geometrical 
cone, see also section [^.1.61 
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depends on the precise ordering of split and merge steps and a fairly widespread procedure 
is described in detail in [2T]. It essentially works as follows, acting on an initial list of 
"protojets", which is just the full list of stable cones: 

1. Take the protojet with the largest pt (the 'hardest' protojet), label it a. 

2. Find the next hardest protojet that shares particles with the a (i.e. overlaps), label 
it b. If no such protojet exists, then remove a from the list of protojets and add it to 
the list of final jets. 

3. Determine the total pt of the particles shared between the two protojets, pt,shared- 

• If Pt, shared /Pt,b > f, whcrc / is a free parameter known as the overlap threshold, 
replace protojets a and b with a single merged protojet. 

• Otherwise "split" the protojets, for example assigning the shared particles just 
to the protojet whose axis is closer (in angle). 

4. Then repeat from step 1 as long as there are protojets left. 

Generally the overlap threshold / is chosen to be 0.5 or 0.75 (the latter is probably to be 
preferred |37]). An alternative to SM is to have a "split-drop" (SD) procedure, where the 
non-shared particles that belong to the softer of two overlapping cones are simply dropped, 
i.e. are left out of jets altogether. The main example of an algorithm with a SD procedure 
is PxCone (described for example in |55]). 

The outcome of split-merge and split-drop procedures depends on the initial set of 
stable cones. One of the main issues with IC-SM and IC-SD algorithms is that the addition 
of a new soft seed particle can lead to new stable cones being found, altering the final set 
of jets. This is infrared unsafety and we will discuss it in detail in the next section. 

2.1.4 Infrared and collinear safety, midpoint cones 

Infrared and collinear (IRC) safety is the property that if one modifies an event by a 
collinear splitting or the addition of a soft emission, the set of hard jets that are found in 
the event should remain unchanged. IRC safety is an important property for a range of 
reasons: 

• A hard parton undergoes many collinear splittings as part of the fragmentation pro- 
cess; and the non-perturbative dynamics also lead to collinear splittings, for example 
in the decay of energetic hadrons. Additionally there is always some emission of soft 
particles in QCD events, both through perturbative and non-perturbative effects. 
Collinear splittings and soft emissions effectively occur randomly and even their av- 
erage properties are hard to predict because of the way they involve non-perturbative 
effects. The motivation for constructing jets is precisely that one wants to establish 
a way of viewing events that is insensitive to all these effects (this is also connected 
with point 5 of the Snowmass conditions). 
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• In fixed-order perturbative QCD calculations, one of the main tools involved in mak- 
ing accurate standard-model predictions at high-energy colliders, soft emissions and 
coUinear splittings are associated with divergent tree-level matrix elements. There 
are also corresponding divergent loop matrix elements that enter with the opposite 
sign. Normally the two sources of divergence should cancel, but for IRC unsafe jet 
algorithms the tree-level splittings may lead to one set of jets, while the loop dia- 
grams may lead to another, breaking the cancellation and leading to infinite cross 
sections in perturbation theory (point 4 of the Snowmass conditions). Below, we 
shall illustrate this point in more detail. 

• Experimental detectors provide some regularisation of any collinear and infrared un- 
safety (because of their finite resolution and non-zero momentum thresholds), but 
the extent to which this happens depends on the particular combination of tracking, 
electromagnetic calorimetry and hadronic calorimetry that is used by the experi- 
ment. This can make it quite difficult to connect experimental results for IRC unsafe 
algorithms to the expectations at hadron-level. 

Cone-type jet algorithms have, historically, been plagued by issues related to IRC safety, 
and a significant amount of the work on them has been directed towards understanding 
and eliminating these problems. Let us therefore examine the question for the two classes 
of algorithm we have seen so far. 

The IC-PR case. IC-PR algorithms suffer from collinear unsafety, as illustrated in fig.[Il 
With a collinear safe jet algorithm, if configuration (a) (with an optional virtual loop also 
drawn in) leads to one jet, then the same configuration with one particle split collinear ly, 
(b), also leads to a single jet. In perturbative QCD, after integrating over loop variables in 
(a) and the splitting angle in (b), both diagrams have infinite weights, but with opposite 
signs, so that the total weight for the 1-jet configuration is finite. 

Diagrams (c) and (d) are similar, but for an IC-PR algorithm. In configuration (c), the 
central particle is hardest and provides the first seed. The stable cone obtained by iterating 
from this seed contains all the particles, and one obtains a single jet. In configuration (d), 
the fact that the central particle has split collinearly means that it is now the leftmost 
particle that is hardest and so provides the first seed. Iteration from that seed leads to a 
jet (jet 1) that does not contain the rightmost particle. That rightmost particle therefore 
remains, provides a new seed, and goes on to form a jet in its own right (for full details, see 
the appendix of |33]). As we have discussed above, it is problematic for the result of the 
jet finding to depend on a collinear splitting. The formal perturbative QCD consequence 
of this here is that the infinities in diagrams (c) and (d) contribute separately to the 1-jet 
and 2-jet cross sections. Thus both the 1-jet and 2-jet cross sections are divergent. 

The IC-SM case. IC-SM (and IC-SD) type algorithms have the drawback that the 
addition of an extra soft particle, acting as a new seed, can cause the iterative process 



11 



Collinear safe jet alg. 

a) b) 



Collinear unsafe jet alg 

c) d) 



jet 1 jet 1 

a^x(-oo) a^x(+oo) 
Infinities cancel 



jet 1 



jet 1 



jet 2 

a^x(-oo) a^x(+oo) 
Infinities do not cancel 



Figure 1: Illustration of collinear safety (left) and collinear unsafety in an IC-PR type algorithm 
(right) together with its implication for perturbative calculations (taken from the appendix of 
[33]). Partons are vertical lines, their height is proportional to their transverse momentum, and 
the horizontal axis indicates rapidity. 





soft divergence 



Figure 2: Configurations illustrating IR unsafety of IC-SM algorithms in events with a W and 
two hard partons. The addition of a soft gluon converts the event from having two jets to just 
one jet. In contrast to fig. [U here the explicit angular structure is shown (rather than pt as a 
function of rapidity) . 



to find a new stable cone. Once passed through the split-merge step this can lead to the 
modification of the final jets, thus making the algorithm infrared unsafe. This is illustrated 
in fig. 121 in an event (a) with just two hard partons (and a W, which balances momentum), 
both partons act as seeds, there are two stable cones and two jets. The same occurs in the 
(negative) infinite loop diagram (b). However, in diagram (c) where an extra soft gluon 
has been emitted, the gluon provides a new seed and causes a new stable cone to be found 
containing both hard partons (as long as they have similar momenta and are separated 
by less than 2R). This stable cone overlaps with the two original ones and the result of 
the split-merge procedure is that only one jet is found. So the number of jets depends 
on the presence or absence of a soft gluon and after integration over the virtual/real soft- 
gluon momentum the two-jet and one-jet cross sections each get non-cancelling infinite 
contributions. This is a serious problem, just like collinear unsafety. A good discussion of 
it was given in [39] . 
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Figure 3: Configuration that is the source of IR unsafety in the midpoint (ICmp-SM) algorithm, 
with the diagram on the right illustrating the extra stable cone that can appear with the addition 
of a new soft seed. Taken from 1401. 



The midpoint "fix" for IC-SM algorithms. A partial solution |32] (described also 
in [39j), which was recommended in [21j, is to additionally search for new stable cones by 
iterating from midpoints between each pair of stable cones found in the initial seeded iter- 
ations (ICmp-SM). This resolves the problem shown in fig. |2]and the resulting "midpoint" 
algorithm has often been presented as a cone algorithm that was free of IR safety issues. 
However, for configurations with three hard particles in a common neighbourhood (rather 
than two for the IC-SM algorithms) the IR safety reappears, as illustrated in fig. [31 

The "midpoint algorithm" has been widely used in Run II of the Tevatron within 
CDF (midpoint cone algorithm) and D0 (Run II Cone algorithm, or improved legacy cone 
algorithm). The two experiments have separate implementations, with slightly different 
treatment of seeds (CDF imposes a threshold, D0 does not), cone iteration (D0 eliminates 
cones below a pt threshold, CDF does not) and the split-merge stage. In practice both 
algorithms incorporate a number of further technical subtleties (for example an upper limit 
on the number of iterations, or split-merge steps) and the best reference is probably the 
actual code (available both within Fast Jet [H] v2.4 and SpartyJet |I2]). 



Impact of IRC unsafety. The impact of infrared and coUinear (IRC) unsafety de- 
pends on the observable in which one is interested. For example for the IC-SM type 
algorithms, the configuration on the right of fig. |2]is a NNLO contribution to the W + jet 
cross section, i.e. a contribution a^a^w x oo. Physically, the infinity gets regularised by 
non-perturbative effects and so is replaced by a factor of order Inpt/A, giving an overall 
contribution a^aEw Inpt/A. Since a<j ~ 1/ ln(pt/A), this can be rewritten as ~ a^aEw, i-e. 
the NNLO diagrams will give a contribution that is as large as the NLO diagrams. Thus 
the perturbative series looks like: 

I 2 I 3 1 Pt , 4 1 2 Pt . 

dsC^EW + Cis^EW + «s«£;vyln— + a^aEw^'^ + ••• , 

LO NLO ' ' ' ^ ' 

NNLO NNNLO 

~ OsOew^ + alaEw + OilaEw + a^a^vy + • • ■ , (2) 

LO NLO NNLO NNNLO 
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Observable 



IR2+1 IR3+1, C0II3+1 



Inclusive jet cross section LO 

W/ZjH + 1-jet cross section LO 

3-jet cross section none 

W/Z/H + 2-jet cross section none 

jet masses in 3-jet and W/Z/H + 2-jet events none 



NLO 
NLO 
LO 
LO 



none 



Table 1: Summary of the last meaningful order for various measurements with jet algorithms 
having different levels of IR and collinear unsafety. Adapted from [40] . 

and it is meaningful to calculate the LO term, but no advantage is to be had by calculating 
terms beyond, because the neglected pieces will always be as large as the NLO term. If 
one instead examines the W + 2-jet cross section then the LO term is a^aEw- The NLO 
term, a^aeiylnpt/A ~ a'^aEw is of the same size, so even the LO prediction makes no 
sense. 

The unsafety of the IC-SM algorithm can be labelled IR2+1: its IR unsafety is manifest 
for configurations with two hard particles in a common neighbourhood plus one soft one. 
The midpoint algorithm is IR3+1, while the IC-PR and FC-PR algorithms are C0II3+1 (the 
collinear unsafety is manifest when there are 3 hard particles in a common neighbourhood, 
of which one splits collinearly) . 

For an algorithm labelled as IR„+i or Coll„+i, the last meaningful order for the W^-l-jet 
or the 2-jet cross section is N^^^LO. The last meaningful order for the W^-l- 2-jet or the 
3-jet cross section is N"~^LO. The situation is summarised for various process in table [TJ 

One way of visualising infrared and collinear unsafety (especially for IR2+1 algorithms) 
is that they lead to an ambiguity in the effective jet radius R — a soft emission or collinear 
splitting affects how far the jet algorithm will reach for particles. For the IR2+1 algorithms 
that ambiguity is of O {R) in the reach (i.e. the jet radius is devoid of meaning). For the 
IR3+1 and C0II3+1 algorithms this analogy is less useful. 

What to do with IRC unsafe measurements. Many IRC-unsafe jet measurements 
exist in the experimental literature jfl Some of these cases are like [33], where the measure- 
ment for the W+n-jet cross section is carried out with JetClu, an IR2+1 unsafe algorithm 
for which no order of perturbation theory is meaningful when n > 1. 

The question then arises of how one can compare NLO theory predictions like | ^ H5 | H6 | 
HTlllH] with the experimental results. One approach, specific to the IC-SM case, is to carry 

^Strictly speaking, many algorithms incorporate a seed threshold, e.g. pt > 1 GeV. This means that 
they are not truly infrared unsafe, in that they don't lead to infrared infinities in perturbative calculations 
(though they are then collinear unsafe if applied to particles rather than to calorimeter towers). However 
a 1 GeV seed threshold fails to remove the large logarithms in eq. Q or to eliminate the non-perturbative 
uncertainties associated with IR unsafety. So the seed threshold does not make these algorithms any better 
than a formally IR unsafe one. 
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Tevatron W + 3jets @ LO (MCFM 5.2) 



out the NLO prediction with two somewhat different jet algorithms (for example SISCone 
and a.nti-kt, both discussed below), and use the difference between the NLO calculations 
with the two algorithms as a measure of the uncertainty in the prediction due to IR safety 
issues. The logic behind this is that SISCone behaves as would an IC-SM algorithm when 
there are soft particles everywhere (combining hard partons into a common jet when they 
are as far as 2R apart in some cases), while anti-Zcj behaves somewhat similarly to an 
IC-SM algorithm when there are no soft particles present (hard partons separated by more 
than R usually do not end up in the same jet). These differences are discussed in more 
detail in section |4]T1 



A comparison of SISCone and anti-fcj was per- 
formed for example in ref. [?8]. It examined the 
+ 3jets cross section at the Tevatron (measured 
with JetClu, R = 0.4 for jets with \y\ < 2 |13]) i-5 
and found that the SISCone prediction was about 
20% smaller than the anti-/cj prediction at LO (the 
difference is reduced at NLO), because in the SIS- 
Cone case there is a higher likelihood that two of 
the three LO partons will be combined into a single 
jet, giving W + 2 jets rather than W -|- 3 jets. This 
may not seem like an enormous effect compared 
to typical experimental systematic uncertainties, 25 30 35 40 45 50 

however one should remember that the size of the [GeV] 
difference depends also on the cuts and the choice 
of R. For example, with a larger R value (e.g. 
R = 0.7) or a smaller rapidity range, the differ- 
ences between the algorithms increase noticeably, 
as illustrated in figure HI 

In the long-run, an alternative approach might 
be to use tools like MC@NLO gO] and POWHEG 
|50] . which may eventually include a range of jet 
processes and thus provide both the NLO terms 



Q. 
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|yje,sl<1,R=0.7 
IVjetsI < 2, R=0.7 
|yjetsl<1.R=0-4 
IVjetsI < 2, R=0.4 



Figure 4: The ratio of the anti-kt and 
SISCone results for the TV + 3 jet cross 
section, shown as a function of the trans- 
verse momentum of the third hardest jet, 
for two different R values and rapidity ac- 
ceptances for the jets, as calculated with 
MCFM [33]. This ratio provides a mea- 
sure of the ambiguity in perturbative pre- 
dictions for an IR unsafe IC-SM jet algo- 
rithm such as JetClu. 
and an acceptable estimate of the large higher- 
order logarithms and the non-perturbative effects (with IRC jet safe algorithms another 
advantage of tools like MC@NLO and POWHEG is that they provide a way of consistently 
including both NLO corrections and non-perturbative hadronisation effects within a single 
calculation) . 



2.1.5 Exact seedless cones 

One full solution to the IRC safety issue avoids the use of seeds and iterations, and instead 
finds all stable cones through some exact procedure. This type of algorithm is often called 
a seedless cone (SC, thus SC-SM with a split-merge procedure). 
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Figure 5: Failure rates for IR safety tests [lOj with various algorithms, including a midpoint 
variant with 3-way midpoints and some seedless algorithms with commonly used, but improper, 
split-merge procedures. See table [21 p. 1301 for the classification of the main different algorithms 
and [40j for a description of the different seedless variants. CDF MidPoint-3 is like the standard 
MidPoint algorithm except that it also uses midpoints between triplets of stable cones. Briefly, 
the IR safety test proceeds as follows: first one generates an event with between 2 and 10 hard 
particles, and applies a jet finder to the event; then one generates some number of random very 
soft particles {pt ~ lO"^''" GeV), and applies the jet finder to the event consisting of soft and 
hard particles. If the hard jets (those with pt ^» 10"^'''^ GeV) are the same in the two cases, then 
the jet finder passes the IR safety test for that event. One repeats the exercise for many events. 
SISCone passed the test for all 4 x 10^ events used. Other algorithms failed the test for some 
fraction of events, as given in the figure. 

In a seedless cone algorithm, the addition of a soft particle may lead to the presence 
of new stable cones, however none of those new cones will involve hard particles (a soft 
particle doesn't affect the stability of a cone involving much larger momenta), and therefore 
the set of hard stable cones is infrared safe. As long as the presence of new soft stable 
cones (or of new soft particles inside hard stable cones) doesn't change the outcome of 
the split-merge procedure (a non-trivial requirement), then a seedless cone will lead to an 
infrared safe collection of hard jets. 

A computational strategy for identifying all cones was outlined in ref. [2Tj: one takes 
all subsets of particles and establishes for each one whether it corresponds to a stable cone 
— i.e. one calculates its total momentum, draws a circle around the resulting axis, and if 
the points contained in the circle are exactly as those in the initial subset, then one has 
found a stable cone. This is guaranteed to find all stable cones. 

The above seedless procedure was intended for fixed-order calculations, with a very 
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limited number of particles. It becomes impractical for larger numbers of particles be- 
cause there are O (2^) possible subsets (think of an A^-bit binary number where each bit 
corresponds to a particle, and the subset consists of all particles whose bit is turned on). 
Testing the stable-cone property takes O (N) time for each subset and so the total time 
is O (iV2^) . This exponential-time behaviour made seedless cones impractical for use on 
events with realistic numbers of particles (the N2'^ approach would take about 10^'' years 
to cluster 100 particles). However in 2007 a polynomial-time geometrically-based solution 
was found to the problem of identifying all stable cones PUJ. The corresponding algorithm 
is known as SISCone and it is described in section 13.21 An explicit test of the IR safety of 
SISCone is shown in fig. [51 

Seedless cone algorithms are also programmed into NLO codes like NLOJET-(--(- [15] 
and MCFM jUj. Users should however be aware that there is some degree of confusion 
in nomenclature — for example the cone algorithm in MCFM v. 5.2 is referred to as the 
midpoint algorithm, but is actually a seedless implementation; in NLOJET++ v. 3 the 
algorithm is referred to as seedless, but has a midpoint option. Users of NLO codes are 
therefore advised to make sure they know exactly what is implemented in the NLO code's 
native jet finder (i.e. they should carefully inspect the portion of code devoted to the jet 
finder). Alternatively they may use appropriately documented 3rd party libraries for their 
jet finding. 

2.1.6 Dark towers 

The xC-SM class of algorithms collectively suffers from a problem known as dark tow- 
ers [51]: regions of hard energy flow that are not clustered into any jet. Dark towers arise 
because there exist configurations in which some particles will never end up in a stable 
cone. The stages of an iteration in which this is the case are shown in fig. E], for which the 
rightmost particle cannot be in a stable cone: even when one uses it as a starting point for 
iteration, it is not contained in the final stable cone (nor is it contained in any stable cone 
in a seedless algorithm). 

One solution to this problem in iterative algorithms is "ratcheting": a particle that 
was included at any stage of the iteration is manually included in all subsequent stages 
of the iteration even if it is not within the cone boundary. This is used in CDF's JetClu 
algorithm (though it is not actually described in the reference usually quoted by CDF for 
JetClu [Si). 

Another fix to dark towers was proposed in [SI] , and referred to as the "searchcone" . It 
eliminates a large fraction of the dark towers by using a smaller radius to find stable cones 
and then expands the cones to their full radius, without further iteration, before passing 
them to the SM procedure. Unfortunately, when applied together with the midpoint 
procedure (ICse,mp-SM) it worsens its IR unsafety status from IR3+1 back to IR2+1 [52]. 

Perhaps the simplest solution [52] to dark towers is to identify any remainder energy 
flow that was not clustered by the xC-SM algorithm and run an extra pass of the algorithm 
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Figure 6: Some of the stages of a stable-cone iteration that leads to dark towers. In the first 
panel, one starts the stable-cone iteration with the rightmost particle (3) as a seed. The cone 
contains particles 2 and 3 and its momentum points roughly midway between them. The direction 
of the momentum is then used as the centre of a cone for the next iteration (second panel). The 
cone in the 2nd panel contains all three particles. The resulting momentum direction provides the 
cone centre in the 3rd panel and now particle 3 is no longer contained in the cone. One further 
iteration leads to the stable cone in panel 4, which does not contain particle 3 even though it 
provided the initial seed direction. For this event, therefore, particle 3 will never appear in any 
stable cone. 



on that remainder. This is the approach used in SISCone (which by default runs multiple 
passes until no energy is left). 

2.2 Sequential recombination jet algorithms 

Sequential recombination algorithms have their roots in e^e~ experiments. A detailed 
overview of their history in e~^e~ studies is given in the introduction of [53]. The intention 
here is not to repeat that history, but rather to walk through the most widely used of the 
e~^e~ algorithms and then see how they lead to corresponding hadron-collider algorithms. 
It should be said that many of the ideas underlying today's sequential recombination 
algorithms (including a momentum- related parameter to decide jet resolution and the 
use of relative transverse momenta) actually appeared first in the LUCLUS algorithm of 
Sjostrand [51] (earlier work includes [551 [SSI EB] ) • However computational constraints 
at the time led to the algorithm including a preclustering phase, and it also involved a 
non-trivial procedure of reassignment of particles between clusters at each recombination. 
These two characteristics made it somewhat more complicated than its successors. 

Today's sequential recombination algorithms are all rather simple to state (far more so 
than the cone algorithms). Additionally they go beyond just finding jets and implicitly as- 
sign a "clustering sequence to an event" , which is often closely connected with approximate 
probabilistic pictures that one may have for parton branching. 



2.2.1 Jade algorithm 

The first simple sequential recombination algorithm was introduced by the JADE collab- 
oration in the middle of the 1980's [591 ISQ]- It is formulated as follows: 
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1. 



For each pair of particles i, j work out the distance 



_ 2E,E,(l-cos%) 

where Q is the total energy in the event Ei is the energy of particle i and % the 
angle between particles i and j. For massless particles, i/ij is the just the (normalised) 
squared invariant mass of the pair. 

2. Find the minimum y^i^ of all the i/ij. 

3. If ?/min is below some jet resolution threshold ycut, then recombine i and j into a single 
new particle (or "pseudojet") and repeat from step 1. 

4. Otherwise, declare all remaining particles to be jets and terminate the iteration. 

The number of jets that one obtains depends on the value of T/cut, and as one reduces T/cut, 
softer and/or more collinear emissions get resolved into jets in their own right. Thus here 
the number of jets is controlled by a single parameter rather than the two parameters 
(energy and angle) of cone algorithms. 

Quite often in e~^e~ analyses one examines the value of ?/cut that marks the transition 
between (say) an event being labelled as having n and n + 1 jets, yn{n+i)- Thus if ?/23 is 
small, the event is two-jet like, while if it large then the event clearly has 3 (or more) jets. 

The JADE algorithm is infrared and collinear safe, because any soft particle will get 
recombined right at the start of the clustering, as do collinear particles. It was widely used 
up to the beginning of the 1990s (and still somewhat beyond then), however the presence 
of EiEj in the distance measure means that two very soft particles moving in opposite 
directions often get recombined into a single particle in the early stages of the clustering, 
which runs counter to the intuitive idea that one has of a jet being restricted in its angular 
reach. As well as being physically disturbing, this leads to very non-trivial structure 
(non-exponentiated double logarithms) in higher-order calculations of the distribution of 
2/23 E21 IB3J (later, this was also discussed in terms of a violation of something called 
recursive infrared and collinear safety ^64j ) . 



2.2.2 The kt algorithm in e+e 



The e~^e kt algorithm [27] is identical to the JADE algorithm except as concerns the 
distance measure, which is 

2min(Ef,E|)(l-cos%) 

= ■ 

In the collinear limit, % ^ 1, the numerator just reduces to {v[vui{Ei, Ej)9ijY which is 
nothing but the squared transverse momentum of i relative to j (if i is the softer particle) 



^In experimental uses, it is often the total visible energy in the event. 
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— this is the origin of the name fct-algorithmj^ The use of the minimal energy ensures 
that the distance between two soft, back-to-back particles is larger than that between a 
soft particle and a hard one that's nearby in angle. 

Another way of thinking about eq. (jl]) is that the distance measure is essentially pro- 
portional to the squared inverse of the splitting probabihty for one parton k to go into two, 
i and j, in the limit where either i or j is soft and they are coUinear to each other, 

^ftlll. ^ (5) 

dEidOij mm{Ei,Ej)eij ^ ' 

There is a certain arbitrariness in this statement, because of the freedom to change variables 
in the measure on the left-hand side of eq. (|5]). However the presence of a power of just the 
minimum of the energy in the denominator (rather than some function of both energies as 
in the JADE distance measure) is robust. 

The kt algorithm's closer relation to the structure of QCD divergences made it possible 
to carry out all-order resummed calculations of the distribution of ?/n(n+i) [23 ESI |66] and of 
the mean number of jets as a function of ?/cut [SZ] • This helped encourage its widespread use 
at LEP. The relation to QCD divergences also means that the clustering sequence retains 
useful approximate information about the sequence of QCD splittings that occurred during 
the showering that led to the jet. This is of interest both in certain theoretical studies 
(for example CKKW matching of parton-showers and matrix elements |13j) and also for 
identifying the origin of a given jet (for example quark versus gluon discrimination |68j). 



2.2.3 The kt algorithm with incoming hadrons 

In experiments with incoming hadrons two issues arise. Firstly (as mentioned already for 
cone algorithms) the total energy is no longer well defined. So instead of the dimensionless 
distance one might choose to use a dimensionful distance 

= 2 mm{El E]){1 - cos %) , (6) 

together with a dimensionful jet-resolution parameter dent (alternatively, one might main- 
tain a dimensionless measure by choosing some convention for the normalisation scale). 
Secondly, the divergences in the QCD branching probability are not just between pairs of 
outgoing particles, but also between an outgoing particle and the incoming beam direction. 

The first attempt at formulating a kt algorithm in such cases was [HS]. It introduced 
the idea of an additional particle-beam distance. 

diB = 2E^{1- cos 9,b), (7) 
^As mentioned above, the distance measured used in the earher LUCLUS algorithm [54], yij = 

I ^ |2 I ^ |2 

2 ^1 J^^i yj'^2Q2 (1 ~ COS 0ij) (in the version given in [53]), was also a relative transverse- momentum type 
variable. 
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which, for small 6iB, is just the squared transverse momentum of particle i with respect 
to the beam. The algorithm then remains the same as in e~^e~ , except that if a diB is the 
smallest, then the particle is recombined with the beam, to form part of the "beam-jet". 
If there are two beams, then one just introduces a measure for each beam. 

In pp collisions it is standard to use variables that are invariant under longitudinal 
boosts, however the dij and diB given above only satisfy this property approximately. 
Thus ref. [28] introduced versions of the distance measures that were exactly longitudinally 
invariant 

dij = min(p2., p^j)AR^j , Ai?J = {y^ - y.f + (0^ - (p.f , (8a) 

diB = Pti , (8b) 

(this variant does not distinguish between the two beam jets)j£| It is straightforward to 
verify that in the relevant collinear limits, these measures just reduce to relative transverse 
momenta, like those in eqs. ( 16|7|) . Furthermore, since {i/i — yj), the 0j and pu are all 
invariant under longitudinal boosts, the dij and diB are too. Nowadays the procedure 
of section B.2.H with the distance measures of eqs. (E]), is referred to as the exclusive kt 
algorithm, in that every particle is assigned either to a beam-jet or to a final-state jet. 



Inclusive kt algorithm. At about the same time that ref. [2S] appeared, a separate 
formulation was proposed in [22], which has almost the same distance measures as eq. ([H]), 



dij = min(p^., P%)^^ , Ai?J = (y^ - y^f + {(pi - (pjf , (9a) 

diB = Pti , (9b) 

where the difference lies in the presence of a new parameter R (also called D) in the dij, 
whose role is similar to i? in a cone algorithm (see below). The other difference in this 
version of the algorithm is in how the dij get used: 

1. Work out all the dij and diB according to eq. ([8]). 

2. Find the minimum of the dij and diB- 

3. If it is a dij, recombine i and j into a single new particle and return to step 1. 

4. Otherwise, if it is a diB, declare z to be a [final-state] jet, and remove it from the list 
of particles. Return to step 1. 

5. Stop when no particles remain. 

^Ref. [5S] also proposes a variant where Ai?fj- = 2(cosh(yi — yj) — cos(0i — ^j)), more closely related 
to the precise structure of the QCD matrix elements; however, to the author's knowledge, it has not seen 
extensive use. 
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Here, all particles are included in final-state jets, there is no concept of a beam jet, and 
there is no dcut parameter — the question of what gets called a jet is determined by R: if 
a particle i has no other particles within a distance R then the dis will be smaller than 
the dij for any j and the particle will then become a jet. One consequence of this is that 
arbitrarily soft particles can becomes jets in their own right and therefore (just as for cone 
algorithms), one should additionally specify a minimum transverse momentum that a jet 
should have for it to be of interest. 

The above algorithm is most unambiguously referred to as the inclusive kt algorithm, 
though when people mention the "fct algorithm" in a collider context, it is nearly always 
the inclusive variant that they have in mind. It so happens that the exclusive and inclu- 
sive variants have identical clustering sequences — it is only the interpretation of those 
clustering sequences that differs. 

The kt algorithm has long been advocated by theorists because it is free of any infrared 
and coUinear safety issues. On the other hand it had been criticised by experimenters on 
the grounds (a) that it was computationally slow, insofar as the two public implementations 
that were available in 2005, KtClus (Fortran) [7D] and KtJet (C++) [7T], both took times 
~ to cluster particles; and (b) that it produces geometrically irregular jets, which 
complicates certain detector and non-perturbative corrections!^ We will return to the 
speed issue in section l3TT| while the irregularity is visible as the jagged boundaries of the 
jets in fig. [71 p. [29] (related issues will be discussed in section [13]). 

Given the number of experimental objections that have been raised in the past regarding 
the kt algorithm in a pp environment, it is worth commenting briefly on the two sets of 
hadron-coUider measurements that have been carried out with the kt algorithm. One, from 
D0 |721[75], had to go to considerable lengths (introducing preclustering) to get around the 
speed issue (D0's fine calorimeter meant that it had many input towers) and found rather 
large non-perturbative corrections from the underlying event (UE); the latter issue perhaps 
discouraged further use of the kt algorithm until CDF performed a similar measurement 
in 2005 [I1[T1]. CDF did not suffer particularly from the speed issue, largely because their 
coarser calorimeter segmentation ensured modest input multiplicities. Also, crucially, they 
showed that D0's large UE corrections were probably a consequence of taking the jet radius 
parameter R = 1. When CDF instead took R = 0.7 (as is common for cone algorithms), 
they found UE corrections that were commensurate with those for cone algorithms. 

It should also be added that the longitudinally invariant kt algorithm was the main jet 
algorithm used at HERA, both in photoproduction (e.g. refs. |3l[75j), the first (pubhshed) 
experimental context in which it was used [76], and deep inelastic scattering (e.g. refs. [77] 
[75]). Compared to Tevatron this was probably facilitated by the lower particle multiplicites 
in DIS and photoproduction and also by the quieter underlying event. 

^''For example, if jets are circular with radius R in the y — 4> plane, then any jet whose momentum points 
at least a distance R away from the edge of the central part of the detector will always be fully contained 
in that central part. If jets can be irregular, with boundaries that sometimes extend beyond a distance 
R from the jet momentum, then there is no such simple way of identifying the region of the detector in 
which all jets will be fully contained. 
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2.2.4 The Cambridge and Aachen algorithms 



The Cambridge algorithm [30j is a sequential recombination algorithm for e^e~ collisions 
that introduces two distance measures between pairs of particles. It has vij = 2(1 — cos6'jj) 
(i.e. the squared angle) as well as the i/ij of eq. ([3]). It reads as follows 

1. If only one particle is left, call it a jet and stop. 

2. Otherwise find the pair of particles with smallest Vij. 

3. If the corresponding i/ij < i/cut, replace i and j with the recombined one and go to 



4. Otherwise: take the less energetic of i and j, remove it from the list of particles, call 
it a jet, and go to step 1. 

The idea here was to combine the i/cut jet resolution of the kt algorithm with a clustering 
sequence dictated by angular ordering, i.e. one that relates closely to the powerful concept 
of angular ordering that arises when considering multiple gluon emission [79j. 




Cambridge/ Aachen. The most widely discussed extension (and simplification) of the 
Cambridge algorithm to hadron colliders was actually originally given in the context of DIS 
studies [31] (another one [80] has seen less study). It is like the inclusive kt algorithm in 
that it uses longitudinally invariant variables, introduces an R parameter, and does away 
with the Hij cut on jets. It procedes by recombining the pair of particles with the smallest 
ARij, and repeating the procedure until all objects are separated by a ARij > R. The 
final objects are then the jetsl"1 

This algorithm was originally named the Aachen algorithm, though it is often now 
called the Cambridge/Aachen (C/A) algorithm, reflecting its angular-ordered Cambridge 
roots. 

Like the kt algorithm, the C/A algorithm gives somewhat irregular jets, and its original 
implementations took a time that scales as N^. The latter problem is now solved (as for 
the kt algorithm) and the fact that the C/A has a clustering hierarchy in angle makes it 
possible to consistently view a specific jet on many different angular scales, a feature whose 
usefulness will become apparent in section 15.31 and is also relevant for a "filtering" method 
discussed below. 

^^Alternatively, one can formulate it like the inclusive kt algorithm, but with dij — ARfJR^ and diB = 1- 



step 1. 
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2.2.5 The anti-fc^ algorithm 



One can generalise the kf and Cambridge/ Aachen distance measures as 

A r2 



rf,, = min(pf!,p^) —f , A4 = {y, - + (0,, - (10a) 



where p is a parameter that is 1 for the kt algorithm, and for C/A. It was observed in 
that if one takes p = —1, dubbed the "anti-fcj" algorithm, then this favours clusterings that 
involve hard particles rather than clusterings that involve soft particles {kt algorithm) or 
energy- independent clusterings (C/A). This ultimately means that the jets grow outwards 
around hard "seeds". However since the algorithm still involves a combination of energy 
and angle in its distance measure, this is a coUinear-safe growth (a collinear branching 
automatically gets clustered right at the beginning of the sequence) 1^ The result is an 
IRC safe algorithm that gives circular hard jets, making it an attractive replacement for 
certain cone- type algorithms (notably IC-PR algorithms). 

One should be aware that, unlike for the kt and C/A algorithms, the substructure clas- 
sification that derives from the clustering-sequence inside an anti-fct jet cannot be usefully 
related to QCD branching (essentially the anti-fc^ recombination sequence will gradually 
expand through a soft subjet, rather than first constructing the soft subjet and then re- 
combining it with the hard subjet). 



2.2.6 Other sequential recombination ideas 

The flexibility inherent in the sequential recombination procedure means that a number of 
variants have been considered in both past and recent work. Some of the main ones are 
listed below. 



Flavour-A;f algorithms. If one is interested in maintaining a meaningful flavour for jets 
(for example in purely partonic studies, or when discussing heavy- flavour jets), then one 
may use a distance measure that takes into account the different divergences for quark and 
gluon branching, as in |811 [82]. The essential idea is to replace eq. @ with 

(^F) _ 2(1 — cos 9ij) j max{Ef, Ef) , softer of i, j is flavoured, , . 

^'■^ Q2 I mm{Ef , Ej) , softer of z,j is flavourless, 

where gluonic (or non-heavy-quark) objects are considered flavourless. This reflects the 
fact that there is no divergence for producing a lone soft quark, and correctly ensures that 
soft quarks are recombined with soft antiquarks. In normal algorithms, in contrast, a soft 
quark and anti-quark may end up in different jets, polluting the flavour of each one. Full 

-"^^If one takes p — > — oo then energy is privileged at the expense of angle and the algorithm then becomes 
collinear unsafe, and somewhat like an IC-PR algorithm. 
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details, and the hadron collider variants, are given in [SI], while an application to 6-jets 
was given [H2], where it led to a much more accurate NLO prediction for the inclusive 6-jet 
spectrum. Related ideas have also been used in a sequential-recombination jet algorithm 
designed for combining QCD matrix elements and parton showers ^83j . 

Variable-i? algorithms. A recent proposal in [84j suggests a class of hadron-coUider 
distance measures of the following form 

dij = mm{p^f,Ptf)AR^j , dis = pffResiPu) , (12) 

where the radius of the jet (now placed in the diB term rather than dij) becomes a function 
of the jet's transverse momentum i?eff(Ptj)- This provides an original way of having a 
jet radius that depends on the event structure, a feature which in general can be useful 
(cf. section 15. ip . In |8l] it was applied specifically to the question of dijet resonance 
reconstruction, with the aim of producing larger jets, Rcs ~ ^/Pt, (appropriate with p <0) 
for resonances that decay along the beam direction, and it led to improved resolution on 
the reconstructed mass peak. 

Filtering, pruning and trimming. As we shall see in section [5l contamination from 
non-perturbative effects associated with beam-remnants (underlying event) in hadron col- 
liders is a major cause of degradation of resolution on jets' energies. One way of reducing 
this [85] is to first find the jets (with some given R) and then reconsider each jet on a 
smaller angular scale, Rmt < R (either by reclustering, or by making use of the hierarchical 
angular information in the C/A algorithm). On that smaller angular scale one then takes 
(say) the two hardest subjets, corresponding physically to a hard parton and its hardest 
gluon emission, while rejecting the junk that comes from the underlying event. A variant of 
this, referred to as "trimming" in [SS], is to retain all subjets above some threshold in trans- 
verse momentum. Initial studies [SSI EH ES] indicate that these can provide non-negligible 
advantages in kinematic reconstructions. 

A related idea, "pruning," was suggested in ref. [SHI IBS]- During the (re) clustering 
of the jet, if two objects i,j are separated by Ai?,^ > R^it and the softer one has z = 
m.m{pti,ptj) < Pt,i+j < Zcut (with Zcut = 0.1 say), then that softer one is simply discarded. 

One issue with filtering, pruning and the variable- approach discussed above, is that 
they all introduce extra degrees of freedom in the jet finding. Thus the gains that they 
may provide come at the expense of having to tune those choices to the specific physics 
analysis that is being carried out. 

3 — )■ 2 recombination. Most sequential recombination algorithms are related to the 
idea of inverting successive 1 — )• 2 perturbative branchings (as used in many parton-shower 
Monte Carlo programs). When simulating QCD branching it can also be useful to consider 
"dipole" branchings, i.e. 2 — i- 3 splittings, as in Ariadne [90]. Correspondingly one can 
imagine a sequential-recombination jet algorithm that inverts these branchings by carrying 
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out 3—7-2 clusterings. This is the principle of the ARCLUS algorithm for e~^e~ 
collisions. In practice its performance is similar to that of other e~^e~ algorithms (as 
discussed in [53J). 

2.3 Jet finding as a minimisation problem 

Several groups have considered jet finding as a minimisation problem. Though not the 
main subject of this review, for completeness it is worth devoting a few lines to describe 
these ideas, which fit into the top-down approach to jet finding, and have been explored 
by several groups over the past decade. 

One approach [92] relates to a method known as fc-means in the more general computer 
science field of clustering |93]. It introduces a partition of particles i into n clusters Lk 
{k = 1 . . .n, with n chosen a priori). For a given partition, each cluster has a centroid Ck 
and one can evaluate a measure 

k ieLk 

where d{pi, Ck) is some measure of the distance between particle i and the centroid k. 
One then chooses the assignment of particles into clusters that minimises S. Part of the 
motivation given for the approach of [92] is that it allows one to also include a range of 
physical constraints (such as the l^-mass in top reconstruction) when carrying out the 
minimisation. However there are open questions as to how it may fare in analyses where 
one doesn't actually know what the number of jets should be (for example because of 
background contamination). 

Two other approaches, "deterministic annealing" (DA) [9lj and the "optimal jet finder" 
(OJF) [95] do away with the idea that a particle belongs to any single jet. Essentially (and 
in a language closer to [95]), they argue that each particle i is associated with jet k with a 
weight Wik such that Wik = 1 (or alternatively also allowing for particles to be associated 
with no jet [HS]). The momentum of jet k is then given by = Yli'^ikPi- the OJF 
approach, one makes an a-priori choice for the number of jets, and then a minimisation is 
carried out over all the entries of the Wik matrix, so as to find the lowest value of some 
cost function, corresponding for example to some combination of the jet masses; one can 
then repeat the minimisation for a different number of jets and introduce some criterion 
for one's choice of the number n of jets, based on the value of the cost function for each n. 

In the DA approach, roughly speaking, given some initial weights, one calculates the 
jet momenta P^, and then one recalculates the weights according to 

where /3 is an inverse temperature and d{pi, Pk) is some distance measure (for example 
AP|^). One iterates until the weights converge. This is accompanied by the observation 
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that for /3 = 0, whatever the starting conditions, the Wik will be independent of i, which 
implies that whatever the initial conditions and value of n, all jets will have identical 
directions (i.e. there is only one jet); as one increases /3, the system will then tend to 
develop a larger number of distinct jets. Thus (3 plays the role of l/rfcut in sequential 
recombination algorithms. 

Finally, just as this review was about to be made public, it was brought to the author's 
attention that a code FFTJet had just been released [96] which is a further approach 
involving minimisation (using fast fourier transforms) as well as weighted assignment of 
individual particles to multiple jets (the method is discussed in detail in ref. [57]). 

The ideas behind the OJF, DA and FFTJet algorithms are certainly interesting, espe- 
cially the concept that a particle may be associated with more than one jet, though it is 
perhaps not obvious that the extra conceptual complexity that stems from this is offset by 
any particular benefits. In the corresponding initial studies of OJF and DA |95l[9l] physics 
performances were found to be comparable to that of the kt algorithm, though a practical 
advantage at the time that OJF and DA were proposed (no longer relevant nowadays) 
was a better scaling of the computational speed with particle multiplicity, ~ N for OJF, 
~ A^^ for DA. The study performed with FFTJet in ref. [HZ] suggests that it might be 
more resilient than other algorithms with respect to the effects of magnetic fields, however 
the study was lacking a number of important physical effects such as the underlying event, 
which might well affect the conclusions. A further point is that FFTJet's timing scales not 
as the number of particles in the event but as kink where k is the number of cells used 
in the fast-fourier transform procedure used for the minimisation. This is an advantage in 
very busy events, but can be a drawback for parton-level calculations or other situations 
with low multiplicity, because one still needs to use a large number of cells in order to 
accurately carry out the minimisation. 

The relation between minimisation and jet finding has also been investigated in [51], 
where stable-cone finding (and cone iteration) has been interpreted in terms of the search 
for the local minima of the potential 

Fin) = lj2Ftr ^Rl - ^Rl) , (15) 

i 

which is a function of the particle momenta and of a cone direction n (a coordinate in 
7], 0). Each stable cone corresponds to a local minimum of the potential as a function 
of n. Investigations have also been carried out [98| 199] into whether one can directly use 
"potential" approaches as a replacement for jet finding altogether. 

For completeness, it should be stated that the above approaches are infrared and 
coUinear safe, as an almost direct consequence of the way in which they are constructed. 
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2.4 Recombination schemes 



The most widespread recombination scheme nowadays is the i?-scheme, or 4-vector re- 
combination scheme. To merge two particles, it just adds their 4-vectors (and it produces 
massive jets). This is the current recommendation according to [2T] . 

A scheme that was widely used in the past at hadron-colliders was the Et weighted 
recombination scheme, which had been put forward also in the Snowmass accord. To 
recombine a set of particles into a jet, it uses the following procedure: 



where the sum runs over the particles contained in the jet, and the jet is taken to be 
massless. This procedure has the drawback that it is not invariant under longitudinal 
boosts if the component particles are massive (though one can formulate boost-invariant 
alternatives in terms of rapidity yi and and transverse momentum pu). 

When other recombination schemes are used, this is usually stated explicitly in the cor- 
responding publication. One should be aware that in some cases the recombination scheme 
used during the clustering (e.g. in the iteration of stable cones) differs from the recombi- 
nation scheme that is used to obtain the final jet momenta once the particle assignments 
to the jets are known. 

2.5 Summary 

We have seen many different jet algorithms in this section. A summary of the main ones 
in common use in hadron- collider studies is given in table [2J Many of the algorithms (and 
all the IRC safe ones) are available from the FastJet [H] or SpartyJet |12] packages (the 
latter provides access to the IRC safe algorithms via FastJet). 

A general recommendation is that hadron-collider algorithms that are IR or collinear 
unsafe should in future work be replaced by IRC safe ones, of which the inclusive kt, C/A 
(possibly with "filtering"), anti-kt and SISCone are good choices. Specifically the xC-PR 
class of algorithms is naturally replaced by the anti-A;^ algorithm (which produces circular 
jets, as illustrated in figure [71 and has similar low-order perturbative properties), while 
SISCone is very much like the IC-SM algorithms, but ensures that the stable-cone finding 
is IRC safe. 

Figure [7] illustrates the jets that are produced with the 4 "choice" IRC-safe algorithms 
in a simple, parton- level event (generated with Herwig), showing among other things, the 




(16a) 




(16b) 



(16c) 
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Figure 7: A sample parton- level event (generated with Herwig |101j ). together with many ran- 
dom soft "ghosts", clustered with four different jet algorithms, illustrating the "active" catchment 
areas of the resulting hard jets (cf. section H7i|) . For kf and Cam/Aachen the detailed shapes are 
in part determined by the specific set of ghosts used, and change when the ghosts are modified. 



degree of regularity (or not) of the boundaries of the resulting jets and their extents in the 
rapidity-azimuth place. 

3 Computational geometry and jet finding 

It takes the human eye and brain a fraction of a second to identify the main regions of 
energy flow in a calorimetric event such as fig. [71 A good few seconds might be needed to 
quantify that energy flow, and to come to a conclusion as to how many jets it contains. 
Those are timescales that usefully serve as a reference when considering the speed of jet 
finders — if a jet finder takes a few seconds to classify an event it will seem somewhat 
tedious, whereas a few milliseconds will seem fast. One can reach similar conclusions by 
comparing to the time for a Monte Carlo event generator to produce an event (from tens 
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PxCone 


ICmp-SD 
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cut on cone pt, 


CMS Iterative Cone 


IC-PR 


C0II3+I 
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PyCell/CellJet (from Pythia) 


FC-PR 


C0II3+I 


m 




GetJet (from ISAJET) 


FC-PR 


C0II3+I 







Table 2: Overview of some jet algorithms used in experimental or theoretical work in hadronic 
collisions in the past few years. SRp=a; = sequential recombination, with p = —1,0, 1 characteris- 
ing the exponent of the transverse momentum scale, eq. (fTOl) : SC = seedless cone (finds all cones); 
IC = iterative cone (with midpoints mp, ratcheting r, searchcone se), using either split-merge 
(SM), split-drop (SD) or progressive removal (PR) in order to address issues with overlapping 
stable cones; FC = fixed-cone. In the characterisation of infrared and collinear (IRC) safety 
properties (for the algorithm as applied to particles), IRn+i indicates that given n hard particles 
in a common neighbourhood, the addition of 1 extra soft particle can modify the number of final 
hard jets; Coll„+i indicates that given n hard particles in a common neighbourhood, the collinear 
splitting of one of the particles can modify the number of final hard jets. Where an algorithm 
is labelled with the name of an experiment, this does not imply that it is the only or favoured 
one of the above algorithms used within that experiment. Note that some of the corresponding 
computer codes for jet finding first project particles onto modelled calorimeters. 
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Type of event 



N 



e~^e~ — )• hadrons event on the Z peak 
HERA direct photoproduction (dijet) or DIS 
HERA resolved photoproduction (dijet) 
Tevatron (a/s = 1.96 TeV) dijet event 
LHC (v^ = 14 TeV) dijet event 
LHC low-luminosity event (5 pileup collisions) 
RHIC AuAu event {y/s = 200 GeV/nucleon) 
LHC high-luminosity event (20 pileup collisions) 
LHC PbPb event (v^ = 5.5 TeV/nucleon) 



30000 



40 
40 
60 
200 
400 
1000 
3000 
4000 



Table 3: Orders of magnitude of the event multiplicities N (charged + neutral) for various kinds 
of event. The e+e~ , photoproduction, DIS and pp results have been estimated with Pythia 6.4[1D21 
dOO], LHC PbPb with Pythia + Hydjet [103] and RHIC has been deduced from [IM]. Note that 
experimentally, algorithms may run on calorimeter towers or cells, which may be more or less 
numerous than the particle multiplicity. 

of milliseconds to a fraction of a second), or for a fast detector simulation to process it. Or 
by considering the number of CPU hours needed to process a typical event sample, which 
might consist of O (10^) events. 

The time taken for jet finding by computer codes depends strongly on the number of 
input particles (or towers, etc.), A^. We don't yet know the exact average multiplicities of 
LHC events, but rough estimates are given in table El With the kt algorithm's "standard" 
A^^ timing, assuming about 10^ computer operations per second, one expects a time for 
clustering a low-luminosity LHC event of 1 s (this is also what one finds in practice). 
So this is close to being "tedious," and becomes dissuasive for high-luminosity LHC and 
heavy-ion collisions, or if one wishes to try out many distinct jet definitions (e.g. several 
different R values to see which is best). A more extreme example is the exact seedless 
cone algorithm following the method in [2T], which has a timing of N2'^ . In practice 
(NLOJET-|--|- implementation ^45]), an event with ~ 20 particles takes about a second, 
so one can extrapolate that even just 100 particles will take 10^'' years. This is beyond 
prohibitive. 

To speed up jet finders one may consider the general class of computational algorithm 
that the jet finder belongs to. For instance, all the SR jet finders are examples of "hier- 
archical clustering" , with a range of different distance measures. General solutions to the 
problem were discussed long ago in the computer science literature by Anderberg |105] . 
with a set of rather good solutions proposed by Cardinal and Eppstein more recently 
in |106[ 1107] , which scale roughly as A^^ . 

Generic hierarchical clustering is, however, a broad problem. For example, given three 
"points". A, B C , generic distances are not 'transitive': if A is close to B and B is close 
to C, this does not imply that A is close to C (the reader is encouraged to think up a 
concrete example for the kt distance measure). On the other hand, jet finding often has 
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a geometrical component (since, at hadron colliders, the rapidity and azimuth coordinates 
represent the surface of a cylinder). In geometry, if A is close to B and B to C, then A and 
C are necessarily also close. This is of significant help, and a whole research field exists for 
such geometric proximity problems, computational geometry. Section [3?T] will show how we 
can make use of this to obtain In scalings for the kt algorithm rather than the A^^ of 
generic hierarchical clustering, or A^^ of the older /c^-clustering codes. Then in section 13.21 
we will examine how to apply computational geometry to cone algorithms. 



3.1 Sequential recombination algorithms 

The original implementations of the kt algorithm [TUl [TT] set up a two-dimensional array 
of the dij, and at each stage of the clustering run through all entries of it in order to find 
the minimum, and then update the array with the entries for the newly created particle. 
Since the dij array is of size O {N"^) and the minimum is searched for O (N) times in total 
(i.e. O (N) clusterings), these implementations take a time ~ A^^. 

We have seen briefly above that there exist generic methods for hierarchical clustering, 
i.e. repeated recombination of the closest pair of objects, that take A^^ time. In general A^^ 
time is a lower bound because, at the very least, one has to consider all entries of the dij 
distance matrix in order to find the smallest. One may then be clever in keeping track of 
distance information as points are recombined, so as to side-step the A^^ growth of some 
kt algorithm implement at ions but the initial search for the minimum among all pairs of 
points seems unavoidable. 

3.1.1 kt algorithm 

To see whether we can evade the A^^ bound, let us examine the kt algorithm's distance 
measure in more detail 

d,, = mm{pl,pl)ARl, = (^^ _ y^y + (0^ _ 0^.)2 . (17) 

We could equally well have considered a distance measure 

A, =P?.A4, (18) 

The smallest of the Dij across all i,j coincides with the smallest of all the dij, since 
mm{Dij, Dji) = dij. So it is irrelevant whether we use eq. (fT7|) or (fTSjl in the kt algorithm. 

Eq. f|T8l) has the important property that the transverse-momentum part depends on 
just one of the two particles, so we can write 

min{A,}=Pi.min{A4}, (19) 



'Essentially, observing that at most O {N) distances change at every pair recombination. 
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i.e. fixing the smallest of the Dij involves i's geometrical nearest neighbour (let's refer 
to it as Qi). So if we can find some efficient way of establishing and tracking that geomet- 
rical information, then rather than finding the minimum of iV^ Dij values, the sequential 
recombination problem involves only finding the minimum of Dig. values. This was the 
key observation of |108] . 

One is then left with the question of how to find the minimum of the Ai??j for each i, 
since this still seems to involve a total of N"^ points. Technically, the problem is that of 
establishing and maintaining a nearest-neighbour graph on the 2-dimensional surface of a 
cylinder. A rule of thumb when faced with such problems is to first ask how one might 
deal with them in 1 dimension, say rapidity y. That is easy: one sorts the points according 
to their y coordinate, and the nearest neighbour of a point is the one that immediately 
precedes or follows it. 

Let us do the bookkeeping for this case with just a rapidity coordinate: 
Initialisation: 

• Sort points according to y coordinate (with a balanced binary tree), find nearest 
neighbours, and find all dig.. [NlnN] 

• Place the dig. in a "priority queue" (a structure for efficient minimum-finding and 
updates; often simply a balanced binary tree) [N In N] 



• Recombine the pair with smallest dig^, remove the corresponding two points from the 
rapidity-sorted tree, add the new one, establish the new point's nearest neighbours 
and establish if it has become the nearest neighbour of any of the existing points. 

[In N per recomb.] 

• Update the priority queue of dig^ values and find the new minimum (only a finite 
number of d^g. will change per round). [In N per recomb.] 

This gives a total time of O {N In N) . To understand the origin of the In N factor, observe, 
for example, that if you organise N objects into a binary tree structure, then the depth 
of the tree will be ln2 N (equivalently, given k levels to the tree, it can contain up to 
S^=o 2^ = 2^ — 1 objects). Any operation such as adding or removing an entry in the 



tree involves working through the depth of the tree, and so paying a price O (In A^). Since 
building the tree can be seen as adding A^ objects this costs O (NlnN) time. 

With two geometrical dimensions, nearest-neighbour finding is more complex, however 
it has been the subject of research by the computational geometry community. One struc- 
ture that can help is the Voronoi diagram [T09j . or its dual, the Delaunay triangulation. 
A Voronoi diagram divides the plane into cells (one per vertex), such that every point 
in the cell surrounding a vertex i has i as its nearest vertex. The structure is useful for 
nearest-neighbour location because the vertex Qi nearest to vertex to i is always in one of 



Iteration: 
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Figure 8: The Voronoi diagram for ten random points. The Delaunay triangulation (dashed, 
red) connecting the ten points is also shown. In this example the points 1, 4, 2, 8 and 3 are the 
'Voronoi' neighbours of 7, and 3 is its nearest neighbour. Adapted from |108j . 

the (few, expected^ O (1)) cells that share an edge with the cell of vertex i. An example 
is shown in figure [HI 

Voronoi diagrams for points can be constructed with O {N In A^) operations |110] . 
Maintaining dynamic point sets is more complicated, however there exists an approach [lllj 
that takes O {NlnN) for the initial construction and expected In A^ per insertion/deletion 
and it is available as a public code, CGAL \11'2\ 1113] . 

A complete expected A^lnA^ implementation that makes use of CGAL is available in 
the Fast Jet program [H]. In practice, InA^ terms in computational geometry come with 
a large coefficient — typically a In A^ term might be smaller than a term linear in A^ only 
for A^ > 10^. Therefore for moderate A^ it is useful to include alternative computational 
strategies. One particularly successful one (optimal in the range 50 < A^ < 10^) makes use 
of the fact that only for ARij < R will a dij distance be smaller than diB, djB in eq. ([6]) — 
therefore one can restrict one's search for i's geometric nearest neighbours to the region 
within R of i. Denoting by n the typical number of points in such a region, one then has 
a O (Nn) algorithm. 

Further details are available in the Fast Jet documentation (from the web site [H]) and 
from an unpublished preprint |114j . 

3.1.2 Special cases 

The above approaches can be used for the whole class of generalised (longitudinally invari- 
ant) kt algorithms, however some special cases deserve comment. 

Cambridge/ Aachen. For the C/A jet finder, there is no momentum scale, so rather 
than having a dynamic planar nearest-neighbour problem, one has a dynamic planar closest 
pair problem, dij = AR^j. This is simpler — essentially one can maintain a nearest 
neighbour candidate for each point ^but it only need be correct for the closest pair. One 
remarkable solution to this probleno was given by Chan in [115] . It is included natively 

^''^ "Expected" means that there can be special cases where the number is parametrically larger. 
^^Which can be stated in a paragraph, though this does not mean that it is simple to understand! 



34 



in FastJet, and is slightly faster than the CGAL based solution (as well as avoiding the 
need for a separate package). 

Note: Chan's solution relies on the use of integer arithmetic (part of its cleverness lies 
in its implicit use of the binary representation of integers). However since rapidities and 
azimuths do not extend to large values, one can safely rescale them by some large constant 
and represent them as integers. 

Anti-kt. The generalised kt algorithm with p < (and specifically anti-fct, with p = —1) 
has the property that it effectively produces jets that grow outwards in a circular pattern 
around a high-pj seed. This leads to configurations with one particle at the centre of a circle 
and many on the edge (the first layer of points on the edge contains O {\/n) particles). 

This is precisely the configuration in which the 'expected' A^lnA^ behaviour of the 
clustering breaks down: the central point has many Voronoi neighbours, and, furthermore, 
is involved in each clustering (so it is removed, reinserted, and then one must go around 
all the points on the edge to see which now is its nearest neighbour). This means that for 
very large N,n, the timing for anti-/cj type algorithms is closer to Ny/n than to A^lniV. 

According to |116] there exist approaches to the planar nearest-neighbour problem 
that have worst-case behaviour N"^ with arbitrarily small e, however these have not been 
investigated in the context of jet finding. 

3.2 A polynomial- time seedless cone 

We saw in section 12.1.41 that the use of particles as seeds, i.e. starting points for cone 
iterations, gets us into trouble with IRC safety: if one finds jets based on the ordering 
of the seeds in pt, then one is sensitive to coUinear splittings; if one uses the stable cones 
obtained from iterating all seeds, then one becomes sensitive to the addition of new soft 
seeds. 

We also saw an exact seedless approach that takes all subsets of particles and establishes 
for each one whether it corresponds to a stable cone — i.e. for each subset one calculates 
its total momentum, draws a circle around the resulting axis, and if the points contained 
in the circle are exactly those in the initial subset, then one has found a stable cone. This 
is guaranteed to find all stable cones. 

For large multiplicities, this is inherently wasteful insofar as most of the 2^ subsets of 
particles don't fit into a circle of radius R on the rapidity-azimuth plane, so there is no 
way for them to form a stable cone. 

The obvious corollary of that observation is that one should only consider subsets of 
points in which the members of the subset are contained within a circle of radius i?, and 
any point not in the subset is outside the circle. It is only these subsets that can ever form 
a stable cone. Therefore rather than considering all subsets of points, one can restrict one's 
attention to all distinct ways of separating points on the surface of a cylinder (or plane) 
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Figure 9: Representation of points on a line and the places where a sliding segment has a change 
in its set of enclosed points. 




(c) 





Figure 10: (a) Some initial circular enclosure; (b) moving the circle in a random direction until 
some enclosed or external point touches the edge of the circle; (c) pivoting the circle around the 
edge point until a second point touches the edge; (d) all circles defined by pairs of edge points 
leading to the same circular enclosure. 



into two subsets, those points inside a circle of radius R, all others outside — a "planar all 
distinct circular enclosures" problem. 

This problem is a clear example of a computational geometry problem. Let's first see 
how we would deal with it in one dimension, for which a "circular enclosure" just reduces 
to a line segment enclosure. Given points on a line and a segment of length 2R, we can 
order the points, place the segment to the left of the leftmost point, and then slide it 
sideways. Each time the left or right edge of the segment touches a point, the contents of 
the enclosure change. The cost of finding all enclosures is just that of ordering the points 
{N\nN). 

How do we extend this to two dimensions? The central idea is that the enclosed 
point set changes when a point touches the enclosure. In Id we can always shift the 
enclosure, without changing its contents, until its edge touches a point (either in or out 
of the enclosure). In 2d we can first shift the circular enclosure until one point touches 
the edge, then pivot the circle around that point until its circumference touches a second 
point (fig. [TU]) . Conversely if we consider all pairs of points (within 2R of each other) and 
draw all possible circles that go through those pairs, then we will have found all possible 
enclosures (one should remember that edge points can be either in or out of the enclosure; 
special treatment is also needed for points that are alone, i.e. further than 2R from the 
nearest other point). 

There are O (Nn) relevant pairs of points (recall, cf. the end of section I3.1.1[ that n is 
the number of objects in a region of area ~ i?^)0 One could directly check the stability 

There is a correspondingly large number of distinct cones, and this has implications for proposal [1171 
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of the cones defined by each pair as follows: one sums the O (n) momenta contained 
within a given cone and then checks to see whether a new cone centred on the direction 
of the resulting momentum contains the same set of points. This would give an O [Nrt^) 
algorithm. Alternatively one can establish a traversal order in which the circle contents 
change by one point at a time, avoiding (with the help of a few other tricks) the need 
to pay a price of O (n) for the stable-cone check for each distinct enclosure. This is the 
basis of the (expected) Nn In n algorithm that is known as SISCone (Seedless Infrared Safe 
Cone) |in]0 

Some comments are due concerning SISCone's timing. There are usually only O (N) 
stable cones, parametrically fewer than the number of distinct enclosures. Might there 
be a way of somehow skipping all the unstable enclosures? It is not clear, because the 
upper bound on the number of stable cones is actually O {Nn) (the much lower expected 
value holds for random point sets [40j). This worst case can actually occur (for example 
with regular sets of "ghosts," cf. section with implications then for the split-merge 
procedure. Normally the split-merge procedure is significantly faster than the stable- 
cone search, in that it takes time O {N'^) (<^ A^nlnn in practicJ^ in SISCone's fairly 
straightforward implementation. However if the number of stable cones is 0{Nn), then 
the split-merge step becomes O (N'^n) unless one applies additional dedicated techniques 
(such as quad-trees or k-d trees, as discussed in [10] and also suggested in |117] ). 

A further comment is due on memory usage: SISCone maintains a hash of circular 
enclosures that it has already seen (and whether they are candidates for stable cones or 
not). That hash has as many entries as distinct enclosures, O {Nn), and this can become 
problematic for very large multiplicitiesjlf] In such cases one could in principle reduce the 
memory use to 0{N), at the expense of a slower run-time O (A^n^/^), but this has not 
been implemented. 



Wl\ to use a Fast Fourier Transform for the stable-cone search (cf. also the FFT Jet package "M" , which was 
released just as this review was being finalised), essentially because it implies the need for a Fast Fourier 
Transform grid of size O (Nn) . 

^ ''It has been pointed out by Sjostrand |118) that SISCone's use of all pairs of points to provide the full 
list of distinct circular enclosures bears a close relation to a technique used in the computation of the 
Thrust jll9j . There, all pairs of particles are used to generate all relevant separations of the surface of a 
sphere into two hemispheres. A corollary of this observation is that SISCone's idea of a traversal order 
could also be used in the context of the thrust, to reduce its computation time to N'^ \nN. 

^^In QCD events and with typical values of the jet finding radius R = 0.4 — 1.0, N/n is usually between 
10 and 100; in 2-dimensional problems, for the multiplicities that are of relevance here, a rule of thumb 
seems to be that Inn is very roughly equivalent to a factor of order 10'^. 

^^If we suppose n ~ O.IA^, and that each entry needs 12 bytes for the hash, two double-precision numbers 
(8 bytes each) to describe the center of the cone, and a pointer to the next hash element, plus various 
overheads, then we get a memory usage of about AN'^ bytes, i.e. nearly 4 GB for N ~ 30 000, which is a 
typical expected LHC heavy-ion multiplicity. 
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Figure 11: Timings for the clustering of a simulated ~ 50 GeV dijet event, to which increasing 
numbers of simulated minimum-bias events have been added (both simulated with Pythia). In 
dark colours one sees SISCone and the FastJet kt, anti-A;i and Cambridge/ Aachen implemen- 
tations. For kt (anti-fcf), the kink at ~ 25000 (A^ ~ 50000) signals the point where FastJet 
switches between Nn and A^ln A^ {Nn^^"^) strategies. In grey one sees results for the KtJet imple- 
mentation [7l] of the the kt algorithm, the Midpoint cone (ICmp-SM) in CDF's implementation 
(with and without a 1 GeV cutoff on seeds) and the JetClu iterative cone (IC^-SM, with a 1 GeV 
seed threshold). All non- Fast Jet algorithms (except KtJet) have been accessed through FastJet 
plugins. 

3.3 Speed summaries 

Statements of timings in terms of their scaling with A^ can hide large coefficients and 
significant preasymptotic corrections. Another issue is that as A^ increases so does memory 
usage, requiring (slower) access to the main memory rather than the CPU cache, and this 
too can affect practical timing results. 

Timings for a subset of commonly used algorithms are shown in fig. [TTl One conclusion 
from that figure is that SISCone, the slowest of the IRC safe algorithms, is still competitive 
in speed with the main public Midpoint-cone code and is acceptably fast unless one goes 
to A^ larger than several thousand. FastJet's implementation of the kt algorithm (and C/A 
and anti-kt) is much faster, with clearly healthier scaling at large A^, and it beats even the 
fast IRC unsafe cone codes, like CDF's JetClu. 

The FastJet curve has a clear kink at A^ ~ 15000. This is the point where FastJet 
switches from an Nn (tiled) algorithm to the A^ In A^ CGAL-based one. One can explicitly 
verify that the CGAL-based algorithm does have an A^ In A^ behaviour, by dividing the run 
times by A^ and plotting the result. This is shown in fig. [T2j 

To conclude this part, we have seen how the computational geometry aspect of jet- 
related problems can be exploited to help resolve many of the practical computational 
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Figure 12: Verification of the A^ln timing behaviour of the CGAL-based implementation of 
the kt algorithm in Fast Jet. The timings are divided by N so as to highlight the remaining In 
dependence. Taken from [114j . 

issues that arise if one is to carry out infrared-safe hadron-collider jet finding. It is probably 
fair to say that this is playing a crucial role in encouraging the LHC experiments to switch 
to QCD-compatible jet finders. In particular, all the LHC experiments now incorporate 
Fast Jet and its SISCone plugin in their software frameworks (and ATLAS also has its own 
~ A^^ implementation of the sequential recombination algorithms) [TBI 11^0] ■ 

4 Understanding jets 

Ideally, one would really like to be able to measure partons in experiments. Jets are 
the closest, physically, that we get to partons. How close are they exactly? And what 
about the fact that a "parton" isn't actually a well-defined concept in the first place? 
An understanding of these questions is part of the key to knowing how best to use jet 
algorithms at colliders, both in terms of choosing which algorithm to use and setting its 
parameters. 

The precise issues that one might investigate fall into various categories. For example, 
how broadly will a jet reach for its constituents (section l4.ll) ? This information is impor- 
tant in terms of one's ability to disentangle different partons in heavy-particle decays (for 
example hadronic tt events, which decay to 6 hard partons). Often, one will use jets to 
reconstruct the kinematics of some 'parent object' (again, a heavy particle that decayed; 
or, in inclusive-jet measurements, a scattered parton from the incoming proton). How does 
the jet's energy relate to that of the 'parent object'? This is affected both by perturbative 
(section [4. 2p and non-perturbative (section [4.31) radiation. Finally LHC is special in that it 
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will have significant underlying-event activity (maybe 15 GeV per unit rapidity) and even 
larger pileup (easily 100 GeV per unit rapidity). How do jets react to this (section 

It is probably fair to say that our understanding of all these questions is still incomplete. 
But the material below outlines some of what we do know. 



4.1 Reach 

4.1.1 Two-particle case 

Given just two massless particles, separated by a distance Ai? on the y — 4> cyhnder, will 
they be recombined into a single jet? This is the simplest of the questions one might ask 
about a jet definition's reach. It was discussed in [29] for the inclusive kt algorithm and 
for a partially specified cone, which behaves somewhat like SISCone. 

It is convenient to take the transverse momenta of the two particles, pa, pt2 (defined 
as the softer one) to be related by 

Pt2 = xpn , a; < 1 . (20) 



Writing the sum of the two particles' momentum as pj, with pu = Pti+Pi2CJ and imagining 
the two partons as coming from a common ancestor, we can also write 

Pn = {l-z)ptj, Pt2 = zptj, (^"T^^0- 

According to the context, results are more simply expressed either in terms of x or z, which 
is why it is useful to introduce both. 

In the kt algorithm, the two particles will form a single jet if du < diB, ^25 (as defined 
in eqs. ([2])), or equivalently if 

ARi2<R. (22) 

In the case of an algorithm like SISCone, based on stable cones, the question is whether 
particles 1 and 2 can both belong to a single cone. This happens if both are within R of 

ARu = zARu 
AR2j= (l-^)Ai?i2 



<R, (23) 



where the relations between ARij and AR12 follow from yj = (1 — z)yi + zy2, 4>j = 
(1 — z)(j)i + z(f)2- Since we have defined z < 1/2, it is the lower condition of eq. ( 123|) that 
is more constraining and it leads to 

ARi2<{l+x)R, (24) 



^"in the widespread 4- vector (i?-scheme) recombination scheme, this is exact only if the particles are very 
close in angle. However it remains a good approximation even for Ai?i2 ^ 1 and so it is an approximation 
that will recur in this section. 
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i.e. the same as the kt algorithm condition, eq. ( 12^ . when p2 is soft, x ^ 1, but reaching 
twice as far when pt2 — Pti ■ 

The conditions eq. f l2^P^ basically account for the behaviours of nearly all jet algo- 
rithms: 

• eq. (122|) holds for the kt algorithm, as well as Cambridge/ Aachen, anti-kt, and the 
xC-PR cone algorithms; 

• eq. dSlD holds for IC^p-SM ("midpoint"), IC^p-SD (PxCone) and SC-SM (SISCone) 
algorithms; 

• IC-SM algorithms without midpoint seeds (JetClu, Atlas Cones) have ill-defined 
behaviour. For just two particles, they lead to a single jet based on eq. (122|) . but 
if additional soft seeds are present then this transforms into eq. (12^ . This is a 
manifestation of their infrared unsafety. 

4.1.2 General case 

The complexity of tracing the behaviour of a jet algorithm precludes general results about 
the reach of different jet algorithms for multi-particle configurations. One question that 
has however seen some attention is that of how the results of section 14.1.11 get modified in 
the presence of parton showering and hadronisation. 

This is a delicate question because to answer it one has to know something about the 
environment that created the two partons: did they come from the branching of a single 
parent quark, or a parent gluon, or even the decay of a colour-singlet particle? And what 
was the parent parton colour-connected to? All of these issues relate to the fact that there 
is no rigorous way of defining partons in the first place. Furthermore, even in a probabilistic 
Monte-Carlo type approximation, the way partons shower and hadronise depends on the 
environment. 

One approach [361 1121] to the question involved superposing pairs of events and estab- 
lishing under what conditions jets that had been identified in the individual events became 
a single jet if one applied the jet finder to the two events combined together. This study 
was performed for IC-SM type algorithms and came to the conclusion that the individual 
jets were merged if there were within 1.3-R of each other 1^ This corresponds roughly to 
expectations based on eq. (12^ . if one performs some reasonable averaging over the jet 
momenta, i.e. x (indeed we will see the value 1.3 appear again below in section 14.2.11) . 
However it fails to provide a direct link with the x-dependence of eq. fl24p . 

^-•^The value 1.3 has also inspired a practice in NLO calculations, still current within the CDF collabo- 
ration (e.g. |122| V that involves placing an artificial cut on the separation between partons within a jet at 
AR — Rsep X R, with Rsep = 1-3. Such ad-hoc modifications of the jet algorithm used in a theory predic- 
tion defeat the purpose of a NLO calculation. It is probably fair to say, however, that in most contexts 
where Rsep has been used, its impact is smaller than the dominant theory and experimental uncertainties. 
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Another approach to testing eqs. fl22|24p was taken in [10]. There, jets were initially 
found with a "reference," R = 1 hierarchical-clustering algorithm. The hierarchy was used 
to identify the two main subjets, 5*1 and 5*2, within the jet. Each event was also clustered 
with a test algorithm T, with Rt = 0.4. The test algorithm is the one whose clustering 
behaviour one wishes to probe and need not be the same as the reference algorithm (and in 
our tests will often not be). One then looked to see if there was a jet with algorithm T that 
contained at least half of the pt of each of 5*1 and 5*2. If there was, the conclusion was that 
the two subjets had ended up (dominantly) in a single jet from the test algorithm. The 
procedure was repeated for many events and one could then plot the fraction of /cj-algorithm 
jets for which this occurs, P2^i{AR, function of the distance AR = Ai^^^^j and 

the momentum ratio x = Pt,S2/Pt,S2- 

In ref. |l0] it was the kt algorithm that was used as a reference. Here we shall instead 
use the C/A algorithm as a reference and the subjets are the two objects whose merging 
in the reference jet's clustering sequence involves the largest kf distance (i.e. the hardest 
merging) @ The results will be based on dijet events simulated with Herwig 6.5 |1U1] . both 
at parton and hadron levels (the latter including Herwig's default soft UE). Reference jets 
were required to have pt > 50 GeV. 

Results are shown in fig. [131 for various jet algorithms T, with parton-level results on 
the left and hadron level on the right. In the regions in black, the two C/A subjets always 
end up in a common T-jet, while in the region in white this does not occur. For the IR-safe 
algorithms one sees rough agreement with the expectations from eqs. fl22f24p . though for 
SISCone the boundary is quite broad and shifted to the left of AR/ Rj- = 2 aX x = 1. This is 
probably partly a consequence of the showering and hadronisation, which limit the stability 
of configurations in which the two subjets are near opposing edges of a cone, as has been 
extensively discussed in [51]. The dependence of the effect also on the split-merge overlap 
threshold / suggests that the split-merge dynamics have a non-trivial impact as well. 
Fig. Hn] also shows results two very IR unsafe algorithms, a plain IC-SM variant (CDF's 
Midpoint algorithm with the midpoint option turned off) and JetClu. Among the relevant 
features, one notes the somewhat different shape for the IC-SM algorithm at parton and 
hadron level, most visible if one examines the contours (dashed lines), especially at higher 
X values. This is a consequence of the IR unsafety. JetClu bears little resemblance to the 
plain IC-SM algorithm, even though it is IC-SM based. More detailed study reveals that 
this is only partially due to its use of ratcheting. 

4.2 Perturbative properties, pt and mass 

Gluon radiation is inevitable from fast-moving partons. How does it affect the properties of 
a jet? Basically the gluon may be radiated beyond the reach of the jet definition ("splash- 
out") and thus reduce the jet's energy compared to that of the parton. Alternatively it 

^^This is inspired by the use of a kt and an angular distance measure in the original Cambridge algorithm, 
and gives clearer results than the kt algorithm's subjets. 
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Figure 13: Shade/contour plot of the probability for two C/A subjets to each have at least 
50% (blue, short-dashed contour; 25% and 75%: red dotted, green long-dashed contours) of their 
transverse momentum within a single test-algorithm (T) jet. Shown based on a sample of dijet 
events simulated with Herwig 6.5 at parton-level and at hadron-level (with Herwig's default soft 
UE), using all C/A jets with pt > 50 GeV. C4Sried out for Tevatron Run II conditions, pp at 
y/s = 1.96 TeV. 



may be radiated within the reach of the jet definition and then generate a mass for the 
jet (assuming a 4- vector-addition recombination scheme). The aim of this section is to 
give some simple analytical understanding of the effect of perturbative radiation on a jet's 
transverse momentum and mass — rules of thumb — as well as references to the literature 
for more detailed analyses. 

For the reader who is interested principally in the results, the two main ones can be 
summarised as follows. For small jet radii, i? ^ 1, the average fractional difference between 
a jet's transverse momentum and that of the original parton is 

(Ptjet - Pt,parton)pert _ quarks: -0.43 1 1^ 1 , ^ 

- gluons: -1.02 J >< ^ + u [a^) . [^^) 

where the O (a^) term depends both on the jet algorithm and the global environment in 
which the parton is to be found (e.g. colour connections to other partons) and is often 
ill-defined because of the ambiguities in talking about partons in the first place. Ignoring 
these important caveats, the above result implies that an R = 0.4 quark (gluon) jet has 
about 5% (11%) less momentum on average that the original parton (for = 0.12). 

The second result is that the average squared jet mass for all non-cone algorithms is 

For both the pt loss and the squared jet mass, SISCone results are similar to kt, anti-Zcf 
and C/A results when -RssiCone — 0.75Rkt. 



4.2.1 Jet Pt 

In many uses of jets, one needs to know how a jet's energy (or pt) relates to the underlying 
hard scale of the process — for example to the mass of a decaying heavy particle (top 
quark, Higgs boson, new particle), or to the momentum fraction carried by a scattered 
parton in an inclusive jet cross section. 

One approach to this is to take a Monte Carlo event generator, let it shower a parton 
from some source and then compare the jet's pt to that of the parton. This often gives a 
reasonable estimate of what's happened, even if the Monte Carlo basically acts as a black 
box, and brings a somewhat arbitrary definition of what is meant by the initial "parton" 
(or of the mass of the top quark). 

Another approach is to take a program for carrying out NLO predictions, like MCFM [H] 
or NLOJET-|--|- ^5j, and for example determine the relation between the jet pr spectrum 
and the parton distribution functions. NLO calculations are perhaps even blacker boxes 
than Monte Carlo generators, on the other hand they do have the advantage of giving 
predictions of well-defined precision; however, one loses all relation to the intermediate 
(ill-defined) "parton" (this holds also for tools hke MC@NLO gH] and POWHEG [SD]). 
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Some insight can be obtained from analytical NLO calculations of jet cross sections, 
such as |123[ I124[ I125[ I126[ 1127] . A feature that is common to them is that at the first 
non-trivial order, cross sections acquire a Ini? dependence in the small- i? limit. The 
small- i? limit is one case where one can say something meaningful the relation between 
a jet's Pt and that of the original parton (another is the threshold limit, for example 
|128t 1129^ ll3Ut I131t 1127] ). because the emitting parton decouples from its environment, a 
consequence of angular ordering. Working in a collinear approximation and considering 
an initial quark, with a gluon emission matrix element proportional to the real Pqq{z) 
splitting function {Pqq{z) = Cf{1 + z^) / {1 — z)) , one can simply write the average difference 

^Pt = Pt,iet - Pt.quark aS 



(Spt) 



pert 



l^jdz P.(max[.,l-.]-l)^ a,{eil^^z)pt) p^^(^) _ ^^^^^^^^^ 



(27) 

where one integrates over the angle 6 between the quark and an emitted gluon and over 
the momentum fraction z that is kept by the quark, weighting the matrix element with the 
loss of momentum from the leading jet, pt(max[2;, 1 — z] — 1), when the gluon and quark 
form two separate jets, 9 > fi,ig{z)R (throughout this section, 9 is to be understood as a 
boost-invariant angle, 9 = ARqg). The quantity faig{z) reflects the algorithm's reach, cf. 
eqs. fl22ll24p and is given by 



1 kt, C/A, anti-fc; 



fi^) = <. , . . . 1-.. (28) 



1 + min(-^, i^) SISCone 

■1—2' 2 ' 



Carrying out the integration in a fixed-coupling approximation gives 

{Spt)pcrt _ Ols 



Pt vr 



Li\nR + 0{as), i? < 1 , (29) 



with Li a coefficient that depends on whether it is a quark or a gluon that is the initiating 
parton (cf. [132]): 

= ^21n2- ~ l.OlCp, (30a) 

/ 43\ 7 

Lg = CAh\n2- — \+njTR—c^ QMCa + O.lST^n^ , (30b) 

One notes that for small R the result of eq. ( 129|) is negative. The unspecified pure O (os) 
term refiects the result's dependence on the large- angle environment. It can be defined 
unambiguously only in the threshold limit. Neglecting it, one comes to the conclusion that 
with R = 0.4, a quark-induced jet has, on average, a pt that is about 4 — 5% smaller than 
the initiating parton, while a gluon jet's pt is 8 — 10% smaller. 
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One can also evaluate the small- limit of the average difference between the pt of a 
SIS Cone jet and (say) a /^t jet (again following |132] ) 



Pt TT 



with 



Kg = (^-^ + ^ In 2 + In^ 2^ Cp ^ 0.323C^ , (32a) 

/ 1321 133 \ /241 25 \ 

ir, = (^-^ + — In 2 + In^ 2 j + - - In 2 j UfTn ^ 0.294^^ + O.OSTn^r^ . 

(32b) 

Numerically, Ki ~ O.SLi, or equivalently the average behaviour of SISCone and the kt 
(and related) algorithms are similar perturbatively when Ini?^^ ~ 0.3 + Ini^siscone, that 
is Rkt — 1.35 -Rsiscone- This feature was originally observed for a generic cone algorithm 
in ESI. 



4.2.2 Jet mass 



Partons (except for heavy quarks) are essentially massless. Jets, in particular those with 
significant substructure, are not. Jet masses are interesting in part because hadronic decays 
of very high-pt top quarks and electroweak bosons will be coUimated by the Lorentz boost 
factor and so form a single jet, whose invariant mass might provide a means to identify 
the origin of the jet (cf. section ES])- 

The simplest quantity to examine is the mean squared invariant mass of a jet. This was 
studied in a hadron-coUider context in [22] and it was pointed out that to first non-trivial 
order, 

(M^) ^ C . ^p]R' , (33) 
vr 

where C is a coefficient that depends on the relative fraction of quarks and gluons and on 
the type of jet algorithmic This is easily derived in the small- limit, e.g. for a quark- 
induced jet: 

(M^)pert - / ^ / ■ pU^^- P,,{z)Q{U,{z)R - e) , (34) 

M2 

In a fixed-coupling approximation, the results can be summarised as 

Cl' = \cp C'; = ^^Ca + YQ^fTn , (35) 

^•^Results for jet masses at O {as) are sometimes quoted as being NLO. It would be more accurate to 
state that they are LO results, since O (as) is the first order at which the mass is non-zero. True NLO 
results would go up to O (a^) . Jet masses can be calculated to NLO in dijet events using the 3-jet NLO 
component of a program like NLOJET+-I- |45) . 
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for /cf-like algorithms and 



7 3 



C^'^ = Cf - -\n2 j ^ O.nCp , (36a) 

Cf^ = C^A(^-21n2^ +njTn(^ln2-^^ ^ 0.66C^ + O.lln^T^ , (36b) 

for SlSCone type algorithms (consistent with the observation that kt and SlSCone type 
algorithms behave similarly for — 1.35 i?siscone)- These results coincide roughly with 
the rule of thumb given in [22] that to within 25%, ^^/{M^ ~ 0.2Rpt, with the exact 
value depending on the mix of quarks and gluons, and subject also to finite-i? effects as 
well as threshold modifications for high jet transverse momenta. Ref. |22] also emphasises 
that eqs. (136|) will be subject to significant higher-order corrections, associated with the 
fact that SISCone's effective clustering reach is somewhat smaller for 2 ~ 1/2 than the 
two-parton reach f{z = 1/2) ~ 2, cf. fig. [T31 This is a point of some importance, and 
one should never lose sight of the fact that all the results given above are based on LO 
perturbation theory and can be quite noticeably affected by higher-order terms. 

When using jet masses for tagging hadronically-decaying boosted heavy objects, it 
is also of interest to know the distribution of the jet mass. At leading order, da/dM"^ 
diverges with a logarithmic enhancement ~ In for small masses (cf. the analytical 
result in |133] ). Higher order terms are enhanced by further powers of In Rpt/M and 
can in principle be resummed. Analytical results exist however only for certain specific 
cases in e~^e~ |134[ I135[ 1136] and DIS |137j and have not been extended to hadron-collider 
jets, in part because of issues such as the non-trivial process dependence [13811139] and jet- 
algorithm dependence |140[ 1141] of soft logarithms associated with delimited ( "non-global" 
[136[ I142[ 1143] ) regions of phase-space. 



4.2.3 Other properties 

Many other properties of jets can be predicted perturbatively. Among them one may 
mention the scale associated with subjets within a jet [27 t[65lfT^ . multiphcities of subjets 
|145[ 1146] and of particles (see e.g. |147] and references therein) and jet shapes [39j (radial 
moments and the fraction of energy that is within a certain central core of the jet). As 
well as providing important handles on our understanding of the QCD dynamics within 
jets, these observables can also be useful for example in discriminating quark and gluon 
jets. One such application is given in [68] . 



4.3 Hadronisation 

The properties of jets are affected not just by perturbative radiation, but also by \ow-pt, 
non-perturbative effects. It is useful to divide such effects into two classes: hadronisation 
and the underlying event. Hadronisation corresponds to the transition between partons and 
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hadrons, and occurs in all high-energy QCD processes {e~^e~ , DIS and pp). The underlying 
event (UE) consists of the multiple \ow-pt interactions that occur between the two hadron 
remnants in a. pp or a. resolved jp collision. Physically, in a pp collision, hadronisation 
and the UE cannot be unambiguously separated (the question of what hadronises depends 
on what has interacted). Nevertheless it is useful to consider them separately, because 
they affect jets in rather different ways. Hadronisation is discussed here, and the UE in 
section 14. 4[ 

With current techniques, the impact of hadronisation cannot be calculated (or even 
easily defined) from first principles. However, in the mid 1990's, methods were developed 
|148[ 11491 1150[ I15H 11521 1153] (reviewed in |154] ) that allowed one to predict the main fea- 
tures of hadronisation, based on ambiguities that arise in perturbative calculations related 
to the Landau pole. 

A somewhat oversimplified statement of the idea is that if a perturbative calculation 
involves an integral over asifJ^), then one can estimate the size of the non-perturbative 
contribution by replacing as{fi) with a purely non-perturbative piece af^(/i) = A5(/i — A) 
where A is commensurate with the Landau scale. So, for example, to estimate the non- 
perturbative correction to a quark jet's transverse momentum in the small- i? limit one 
takes eq. fl271) and writes 

— j rf4max[z,l-^]-l)^^^^— ^i^P,,(^)e(^-/aig(^)i?), 

(37a) 



2CVA 



(37c) 



where in the second line one makes use of the knowledge that the 5-function will select 
1 — z = A/{6pt) ^ R. For gluon jets the result is the same except for the replacement 
Cp — Ca- a crucial idea in calculations such as eqs. (1371) is that one can apply the 
sameprocedure to a wide range of observables and the same value of A should hold for 
eacho This is known as "universality" . Universality has been investigated in some detail 
for event shapes in e~^e~ and DIS collisions and there is some debate as to just how well 
it works (e.g. [155] as compared to [156] ). However for the purpose of understanding the 
essentials of the hadronisation of jets it is probably an adequate assumption, and one can 
take A ~ 0.6 GeV. 

The basic result that hadronisation removes transverse-momentum O (A/R) from a jet 
was presented in [148] (and could be deduced from the results of |39j; hadronisation as 
a shift in pt was also discussed in |157j ). Aside from the quark/gluon jet difference it is 



^^As long as they all share the same pt-dependence in the infrared — a less oversimplified formulation 
of the idea in eq. ([37]) is that observables with the same IR pt-dependence are all sensitive to a common 
moment of the coupling in the infrared. 
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a process-independent result, as long as R is much smaller than the angle between jets. 
It seems, however, that this result had largely been forgotten until the advent of a more 
recent calculation |132] , which goes beyond the small R limit (in a threshold approximation 
for dijet production). As an example, the result for the case of the qq' — )■ qq' subprocess of 
dijet production is 



(38) 



A feature of eq. fl38p is that the first correction to the 1/R term is fairly small even for R = 1 
(less than 20%). Consequently for most purposes it is adequate to take just the 1/R piece. 
This is what is done in fig. [TH whose lower set of curves in each quadrant compares the 
hadronisation correction as deduced from Herwig (solid lines) and from Pythia (dashed 
lines) with the 1/R part of the analytical expectation given above (dot-dashed lines). 
Generally speaking the agreement is good, even in the large- i? region where the 1/R 
approximation might be expected to break down. 

To obtain a closer relation to studies of hadronisation for e^e^ and DIS event shapes 
(for a review see e.g. [159j ). one may replace 

A 2 

y -MAifii) . (39) 

TT TT 

Here A{fii) is defined as the integral over a non-perturbative contribution to as, Sag, up 
to some infrared matching scale fij (usually 2 GeV), A{^i) = ^ J^' di^tSas (nt)- Follow- 
ing [149J, it is often written in terms of yet another parameter ao(/i/), the integral of the 
full coupling in the infrared q;o(/^/) = Jq' duagin), via 
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ao (/i/) - as{pt) - ^ (\n— + ^ + 1] a'iipt 



2n \ fii (3o 



(40) 



where K = Ca — ~ f^/) the subtracted terms in eq. (HUi) remove double 
counting with contributions already included in NLO calculations. Fits to data in DIS and 
e~^e~ usually give ao(2 GeV) ^ 0.5. 

The factor M in eq. (ES]) is known as the Milan factor [HUl USD IISSl MM- It 

accounts for the corrections that arise when one considers two non-perturbative "gluons" 
rather than a single one. For all known event shapes, Ai has been calculated to be = 
1.49 — this "universality" of the Milan factor is due to the fact that event-shape observables 
are effectively linear in the momenta of soft gluons |161] . For jets it had been argued 
that only the anti-^t algorithm satisfied this linearity property P3j. This was recently 
confirmed in an explicit calculation by Dasgupta and Delenda jl65] . who showed that the 
kt algorithm instead has Aikt = 1-01. This smaller value is consistent with the somewhat 
reduced hadronisation corrections observed for the kt algorithm compared to anti-/ct in 
fig. [HI though a detailed quantitative comparison has not yet been performed. Future 
calculations of the Milan factor for C/A and SISCone will hopefully also fit in with the 
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Figure 14: Modification of the pt of jets due to the underlying event (upper curves in each 
plot) and hadronisation (lower curves), for qq — )• qq scattering at the Tevatron Run II {pp, 
y/s = 1.96 TeV), comparing Pythia 6.412 [lUO] (tune A, dashed lines) and Herwig 6.510 [TOT] 
with Jimmy 4.3 [158 ] (solid lines). In the case of hadronisation, the Monte Carlo outputs are 
compared to the 1/R part of the analytical result, eq. p8]l (dot-dashed lines). Dijet events are 
selected containing an underlying qq — )• qq scattering, and with the requirement that at parton- 
shower level the hardest jet has 55 GeV < < 70 GeV. The non-perturbative corrections shown 
correspond to the average for the two hardest jets. Taken from |132j . 
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pattern of slight differences that are observed in fig. [H] with respect to the algorithm- 
independent behaviour that is given by eqs. fl37c|38p . 

A point emphasised in |157] is that even if the non-perturbative modification of a jet's 
Pt is rather modest, (9(1 GeV), it can nevertheless have a significant impact on steeply 
falling cross sections. Given a jet pt spectrum that falls as p^", the full result for the jet 
spectrum can be expressed in terms of the perturbative spectrum and the non-perturbative 
shift as 

-iPt) -j—iPt - {Spthp) - -i-iPt) ■ U - — ■ (41) 



dpt dpt dpt V Pt 

Thus for typical values of n in an inclusive-jet spectrum, a 2% change in pt can lead to a 
10 — 15% change in the cross section (this observation holds also for pt shifts due to the 
underlying event, which are discussed below). These are the order of magnitudes often 
seen in experiments' Monte Carlo studies of hadronisation, whose results also cast light 
on the i?-dependence of non-perturbative effects and on the differences between jet 
algorithms [211 [ISSl El ES]. 

A final point is that the above methods can also be used to calculate the non-perturbative 
corrections to the squared jet mass, 

9C AC 
{5M')^pc^^Apt{R + 0{R''))=^MA{iii)pt{R+0{R')) , (42) 

TT TT 

where the R^ terms have small coefficients |132] . Note that for jet algorithms other than 
anti-kt, the Milan factors for {6pt)T<sp and (5M^)np will not be the same. 

4.4 UE, pileup, jet areas 

While the process of hadronisation may well be reasonably universal between e+e^, DIS 
and pp collisions, the latter have the additional feature of the "underlying event" (UE), 
which can be thought of as the semi- or non-perturbative interactions that occur between 
hadron remnants in a pp collision. Our understanding of the UE is somewhat less developed 
than that of hadronisation. One way that one can model it is by saying that it induces an 
extra amount of transverse momentum per unit rapidity, A[/£;o In this case a jet should 
receive a position contribution to its pt from the UE that is proportional to the region of 
the rapidity-azimuth region that it covers, i.e. ~ R^: 

{6pt)vE ^ AueRMR) =Aue(^-^ + ..) . (43) 



where terms at i?^ and beyond |132j hold for the a 4-vector (E) recombination scheme. 
The corresponding formula for the change to the squared jet mass is 

{Mf^)„, A„,p, (^ + ^ + . . .] . (44) 



^^Later we will talk of transverse momentum p per unit area on the rapidity-azimuth plane; Ajje — 

2-KpuE- 
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Figure 15: Similar to Fig. but for just one algorithm, at the LHC {pp, ^/s = 14 TeV) rather 
than the Tevatron, and for gg — )• gg collisions (rather than qq — )• qq). Taken from |132j . 

In fig. [m the upper curves represent the UE contributions to the pt of Tevatron jets, 
as determined from the UE models in Pythia fTOOj (tune A [52]) and Jimmy [158] (with 
the ATLAS tune [52]). Three features are worth commenting on: (a) the curves agree 
with a rough i?^ dependence, (b) the two models disagree by a factor of two even though 
they have both been tuned to Tevatron data and (c) the value that one extracts for Aue? 
in the range 2 — 4 GeV, is quite a bit larger than the pt per unit rapidity that would be 
generated by normal hadronisation for a quark or gluon dipole stretched between the two 
beams (respectively 0.5 GeV and 1 GeV). 

For the LHC, the models predict an even larger contribution from the UE, cf. fig. [15] 
from which one deduces Aue ~ 10 GeV, and for a large range of R it dominates over 
hadronisation. Furthermore pileup (multiple pp collisions in a given beam crossing) is 
expected to add up to an extra 100 GeV of soft "junk" per unit rapidity. 

All of this implies that it is important to understand in more detail how jets are affected 
by "low-p(" noise that is roughly uniformly distributed in rapidity. Two things can happen: 
firstly, the soft junk can end up in the jet — to study this it is useful to refer to the "jet 
area" [37|, a measure of the extent of the region in rapidity and azimuth over which a jet 
captures UE or pileup; secondly, the presence of the UE (and pileup) can modify the the 
way non-UE particles get clustered into jets, a process named back-reaction in |37j . 

4.4.1 Jet areas 

A jet's "area" is a way of measuring its susceptibility to contamination from soft radiation. 
Two definitions were proposed for it in [37] and the results quoted here are all taken from 
there. The passive area is a measure of the jet's susceptibility to pointlike radiation. One 
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introduces a "ghost" particle, g{y, 0) with infinitely low transverse momentum and situated 
at some rapidity and azimuth 0, and then defines the area for jet J in terms of the region 
in over which the ghost is clustered with the jet: 

a{J)^ [dyd<f>f{g{y,<P),J) /(^, J) = / ^ ^ . (45) 



g^J 

For an infrared safe algorithm J itself is of course unaffected by the addition of the infinitely 
soft particle g. If J consists of a single (hard) particle then its passive area is nR^ for all 
algorithms. 

An alternative definition of area is the active area, which measures a jet's susceptibil- 
ity to diffuse radiation. Here one imagines a large number of very soft ghost particles, 
uniformly distributed in rapidity and azimuth (with some optional randomness). One can 
define a jet's active area for a given ensemble {gi} of ghost particles, 

AiJ I {g^}) = ^ . (46) 

where Mgi^J) is the number that end up in the jet and z/g their number density in 0. One 
then often considers the average active area, an average over many ghost ensembles 

A{J)= hm {A{J\{g;}))^, (47) 



taken in the limit of many ghost particles (with fixed infinitesimal total pt per unit area)o 
A key difference between the passive and active areas, is that in the latter, with many 
ghosts, the ghosts can cluster not only with the real event particles but also with other 
ghosts and this modifies the result for the area. 

Given that one usually imagines the UE and pileup as being fairly diffuse (and UE/pileup 
particles can cluster between themselves), it is the active area that is probably the most 
natural measure of sensitivity to the UE or pileup. However, there are two reasons why it 
is useful to consider both passive and active areas. Firstly, the UE is actually somewhere 
in between diffuse and pointlike and a full understanding of UE contamination benefits 
from considering both limits. Secondly, of the two, the passive area is often simpler to 
treat analytically. 

Let's illustrate these points for the case of a jet that has just one, hard particle (IP J). 
As mentioned above, the passive area is nB?. This statement holds for all jet algorithms. 
The average active area and its standard deviation over ghost ensembles are given in table HI 
For one algorithm, anti-/ct, the active area is identical to the passive area and A{J \ {gi}) is 
independent of the particular ghost ensemble. This can be seen as advantageous, insofar 



These two areas are strictly speaking both "scalar" areas. Passive and active areas also come in 
"4-vector" variants, which take into account the ghosts' full impact on a jet's 4-vector. Though useful for 
subtracting pileup and other noise, most of their features are quite similar to those of scalar areas, so we 
shall not discuss them separately in this section. 
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Algorithm 


kt 


C/A 


SISCone 


anti-A;^ 


^(IPJ) ±S(1PJ) 


0.812 ±0.277 


0.814 ±0.261 


i±o 


1 ±0 


A(GJ) ±S(GJ) 


0.554 ±0.174 


0.551 ±0.176 







Table 4: The average active area A(IPJ), eq. ()47p . for an isolated one-particle jet in various 
jet algorithms and its standard deviation S(IPJ) over ghost ensembles. Results are also given 
for the area of jets that are purely composed of ghosts (GJ), in the cases where this makes 
sense (in SISCone the result depends critically on the distribution of ghosts, while for anti-kt the 
distribution of ghost-jet areas has two peaks, one at 0, the other at ttB?). 

as it implies that the contamination of an anti-fcj jet will be independent of the detailed 
structure of the UE or pileup. 

SISCone also has the property that its active area is independent of the ghost ensemble, 
but the actual value is much smaller than for the passive area (a consequence of the split- 
merge step which eats away from the main jet). This is good insofar as it means less overall 
sensitivity to noise, but bad because the exact amount of contamination will depend on 
the details of just how pointlike the UE or pileup is (a feature that therefore needs to be 
well tuned in Monte Carlo programs). 

The kt and C/A algorithms are unlike the other two in that A{J \ {gi}) depends signifi- 
cantly on the exact set of ghosts, as indicated by the standard deviations in table IH which 
are about one third of the average area. The non-zero standard deviation arises because 
ghosts tend to cluster between themselves before clustering with the hard particle, and 
slight shifts in the layout of the ghosts lead to significant differences in the final clustering. 
This implies an extra source of fluctuations from UE and pileup contamination: one has 
not only the intrinsic fluctuations in the amount of pt in a given event's UE/pileup, but 
also a fluctuation in how much of the UE/pileup the jet actually captures. The consequent 
(moderate) worsening of the kinematic resolution of the jets seems to be an inevitable 
feature of jet algorithms with a QCD-motivated hierarchical clustering sequence: the al- 
gorithm is trying to assign meaningful QCD substructure to the jet, and the absence of 
such substructure in the UE/pileup induces a degree of randomness in the outcome of the 
clustering (this is related also to the irregularity of the jet boundaries in flg. [7]). 

The above results hold for 1-particle jets (IP J). Real jets are more complex because 
QCD branching gives them substructure. To a flrst approximation, one can examine what 
happens if one adds a single soft gluon at an angle 6 with respect to the jet axis. This gives 
a modifled jet area, e.g. ajA,R(6') in the passive case, illustrated in flg. [161 After integration 
over the (soft, coUinear) QCD branching matrix element one obtains 

(«jA,ii) = vri?^ ± rfjA,^ ^ In ^^l§^ , (48) 

where the anomalous dimension (i.e. the pt-dependent rightmost term) stems from the 
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Figure 16: Schematic representation of the passive area of a jet containing one hard particle 
"1" and a softer one "2", ajA,_R(^i2)) for various separations between them and for the usual 4 
jet algorithms. Different shadings represent distinct jets. Figure adapted from 





a(lPJ) ^(IPJ) 


cj(lPJ) S(IPJ) 


d D 


s S 


h 


1 0.81 


0.28 


0.56 0.52 


0.45 0.41 


Cam/ Aachen 


1 0.81 


0.26 


0.08 0.08 


0.24 0.19 


SISCone 


1 1/4 





-0.06 0.12 


0.09 0.07 


anti-A;^ 


1 1 












Table 5: A summary of main area results for our four jet algorithms: the passive (a) and active 
{A) areas for 1-particle jets (IPJ), the magnitude of the passive/active area fluctuations (a, S), 
followed by the coefficients of the respective anomalous dimensions (d, D; s, S), in the presence 
of perturbative QCD radiation. All results are normalised to vrii^, and rounded to two decimal 
figures. For algorithms other than anti-fcj, active-area results hold only in the small- i? limit, 
though finite- i? corrections are small. 



integration over the energy of the soft gluon, and its coefficient is given by 

djA,R = y (ajA,i?W - vri?2) . (49) 

In eq. ( HSj) bo = ^^'~^i2t!'^^ ^^"^ colour factor of the hard particle in the jet {Cp 

or Ca)- The scale Qq is a non-perturbative cutoff scale, introduced by hand, and which is 
necessary because jet areas are not IR safe (except in the case of anti-fcj). The physically 
natural value for Qq will depend on the characteristics of the UE/pileup: given an amount 
of transverse momentum p per unit area, one expects Qq to be O {pB?)rA 

Formulae analogous to eqs. fl48|49l) hold for the active area (vri?^ — )■ y4(lPJ), d — )■ D), 

^^That is: a transverse momentum with respect to the beam O (^pR^), which translates to a transverse 
momentum with respect to the jet O (pR^) , it being the latter that corresponds to Qq in eqs. (l48l) etc. 
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Figure 17: Average active area and standard deviation (solid lines and band) in simulated 
Herwig 6.5 events (default UE) compared to analytical expectations (dashed lines) with a fitted 
Qo value; shown separately for 4 algorithms and for qq — )• qq and gg — )• gg events. Adapted 
from [371 E3]. 
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and also for the standard deviations ctja,/? of the passive area, 





-j{a^A,R{d)-'^R^? (50) 



and the active area 



—— m — . 

nbo as{Rpti) 



(51) 



The various coefficients are all summarised in table |5l while in fig. [T7] the resulting pre- 
dictions are compared to jet areas measured in Herwig Monte Carlo simulations (with Qq 
fitted on a case-by-case basis). One sees how kf has a fairly large area, large fiuctuations 
and strong pt dependence, C/A an area ~ nR"^ with moderate fiuctuations and little pt 
dependence, SISCone an area smaller than ttR^ (but still larger than 7ri?^/4), small fiuc- 
tuations and moderate pt dependence and, finally, anti-/ct an area very close to nR^, with 
almost no fiuctuations or pt dependence. A corollary of the kt algorithm's strong pt de- 
pendence is that is that if one increases the UE/pileup density p, the jet area will shrink, 
a consequence of the presence of as{Qo) in eq. (HSj) . with Qq ~ pR^. 

A final comment concerns the relation between passive and active areas. They differ in 
sparse events because there is an ambiguity in how one assigns "empty" parts of the event 
to the jets — the two kinds of area simply consist of different prescriptions for doing this. 
In very dense events, where the jet boundaries are well delimited by the event's particles, 
the passive and active areas become identical (as should any other sensible definition of 
area) . 

4.4.2 Back reaction 

Suppose you have an event with particles numbered 1 — 100, and in which numbers 1 — 10 end 
up in jet a. Now immerse that event in a soft background with finite pt- The background 
will add its own particles to the jet, but it can also alter the behaviour of the clustering 
with respect to the original particles. So maybe only particles 1 — 9 would end up in jet a, 
or maybe jet a would additionally contain particle 11. This is back reaction. 

The study of back reaction bears similarities to that of jet areas: in particular one 
can study it with pointlike noise, or with diffuse noise. The former can be dealt with 
analytically, whereas the latter is tractable only numerically. 

Full details are given in |27|, but the basic analytical result is that the average net 
change in a jet's pt due to back reaction in the presence of diffuse noise has an asymptotic 
behaviour of the form 



where the coefficients are Bkt,R ~ Bc/a,r - -O.lOvri?^ and -Bsiscone.R = -Banti-fct.R = 0. 




(52) 



In practice, given the small size of the B coefficients (and the fact that the ^ In 
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Figure 18: The distribution of back reaction for high-p^ jets (pt > 1 TeV) immersed in pileup 
corresponding to high-luminosity LHC running (p ~ 15 GeV per unit area). Simulated with 
Pythia 6.4 and shown for 4 algorithms. 



factor is often O (1)), the term of order a^p is usually as important as the formally leading 
term. Both terms are generally small compared to the direct contamination of the jet from 
UE/pileup noise, O (p ■ vri?^). 

The concrete situation for the various algorithms is illustrated in fig. [TS], which shows 
the distribution of back reaction for a high-pt jet immersed in pileup (p ~ 15 GeV). In 
about 1% of events one has a back reaction of order of p, except for anti-/ct, whose back 
reaction is far more suppressed. Fig. [18] confirms that back reaction is a modest effect 
compared to the direct contamination of a jet from background noise. Essentially it is 
relevant only when trying to determine a jet's energy to very high precision, or in the 
presence of extreme noise (as in heavy- ion collisions). 



4.5 Summary 

We have seen a number of results here. Let us summarise them: 

• Most jet algorithms will cluster a pair of particles if they are within R of each other; 
SISCone reaches out to 2R (somewhat less in real events) if the two particles are of 
similar hardness. 

• At small R, a jet's pt is reduced relative to a parton's by an amount ~ a^ptln 1/R. 
With R = 0.4, that's of order 5% for a quark, 10% for a gluon. The mean squared 
jet mass goes as asR^pl- 
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• Hadronisation reduces a quark jet's pt by an amount ~ 0.5 GeV/R at small R 
(roughly double this for a gluon jet), with modest differences between algorithms. 

• The underlying event and pileup induce contaminations proportional to ~ i?^. Be- 
cause the intrinsic energy scale associated with the underlying event {pt per unit 
rapidity of 10 — 15 GeV at the LHC) is an order of magnitude larger than that from 
hadronisation (and pileup is yet another order of magnitude larger), one should de- 
vote special effort to understanding different jet algorithms' susceptibility to them. 
This can be done via the concept of jet areas. The kt algorithm has the largest jet 
area (with noticeable pt dependence and fluctuations), SISCone the smallest, and 
anti-fct has the most stable jet area, nearly always irR"^. 

• The UE (and pileup) modifies how an event's original particles get clustered into jets 
— this is back reaction. Its impact is an order of magnitude smaller than the direct 
contamination, but can be relevant for precision studies. It is essentially zero for the 
anti-fct algorithm. 

The differing impact of various physical effects across algorithms and as a function of R 
might seem like a source of considerable complication in jet finding, and in some ways 
it is. However, it can also be used to our advantage. One example is in studies like 
top-mass measurements that are in part limited by physics-modelling systematics. If one 
uses multiple jet definitions with different sensitivities to UE/pileup, gluon radiation and 
hadronisation, and then finds that the final Monte Carlo-corrected top mass is independent 
of the choice of jet definition, then this provides a powerful cross check of the physics 
modelling within the Monte Carlo generator. And, in the next section, we shall see how 
our understanding of jets can help guide the choice of "optimal" jet definitions for various 
reconstruction tasks. 

5 Using jets 

So far at hadron colliders, jets have mostly been used as fixed objects — universal, if 
imperfect proxies for partons. Generally, experiments have settled on one or two main jet 
definitions for nearly all their analyses: for example at CDF the Midpoint algorithms with 
R = 0.7 for most inclusive-jet studies]^ and JetClu with R = 0.4 for top-quark physics 
and for searches. 

Such a strategy was probably not too far from optimal at the Tevatron, where most of 
the physics being looked at is in a modest range of scales, from a few tens of GeV to a few 
hundred, and pileup is present, but not overwhelming. 

The LHC, in contrast, will cover a broader range of scales from a few tens of GeV to 
a few TeV, events with multiple simultaneous scales will be common (e.g. EW bosons and 

^® Though in recent years they also studied the kt algorithm with three R values [IJ[73], and this probably 
played a key role in convincing the LHC experiments that the kt algorithm is viable. 
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top quarks with pt ^ m) and pileup will range from almost none to 20 — 30 simultaneous 
pp interactions in each bunch crossing. This begs the question: could analyses benefit from 
more flexible jet finding? 

The work examined below tries to examine this question by concentrating on two char- 
acteristic types of analysis — standard mass reconstructions, with attention also to the 
issue of pileup; and the task of identifying highly boosted massive particles. Our discussion 
will be restricted to studies at "particle level" (also referred to as hadron level) and won't 
go into detector-specific effects. For a discussion of the latter, see for example |167[I120[ [TB]. 

5.1 Choosing an algorithm and a radius 

Which jet algorithm is "best"? This is a widespread question, and a natural follow-on 
question is "which R should one use"? This question cannot be answered in isolation. It 
inevitably goes with the issue of what one wants to use the jet algorithm for. 

The most reliable way of answering the question is to carry out a detailed study of the 
process one is interested in, with many jet algorithms, and many R values for each. Then 
one may devise some "quality measure" and establish which algorithm optimises it. This 
can be a big job (4 algorithms, maybe 10 R values) and is seldom done in a systematic 
way. In what follows we'll see how even crude analytical estimates can give guidance on 
the question, and then examine some Monte Carlo studies. 

5.1.1 Analytical study 

In a QCD measurement like that of the inclusive jet spectrum, one will compare data to 
a perturbative QCD prediction. At moderate pt, one of the largest ambiguities in the 
comparison comes from non-perturbative effects (since perturbative effects are calculable 
to some accuracy), often comparable with PDF and experimental uncertainties. Therefore 
one might want to choose an algorithm and R value that minimise hadronisation and 
UE contributions. One can choose to ignore the relatively modest differences between 
algorithms, and just take the analytic formulae of sections 1^^14.41 From these, one deduces 
a value of R that minimises the squared sum of hadronisation and UE pieces |132] . 



where only the leading R terms have been used, Ci is the appropriate colour factor {Cp, Ca) 
for quark/gluon jets, and we have assumed a jet area of ttR"^ for simplicity The resulting 
numerical R values are given in table 16)1^ 

^^In practice, an additional issue is that perturbative uncertainties from missing higher-order contribu- 
tion may also depend on R. The interplay between this and non-perturbative uncertainties has not been 
studied. 
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quark jets 


gluon jets 


Tevatron 


0.56 


0.73 


LHC (14 TeV) 


0.41 


0.54 



Table 6: R values that minimise the two non-perturbative contributions in various circumstances 
for Tevatron and LHC running, based on eq. ()53p . with 2Ai A{fii)/'TT = 0.19 GeV and Aue = 
4 GeV (10 GeV) at the Tevatron (LHC). 
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Figure 19: Left: sum of the squares of the mean shifts of a jet's momentum due to perturbative 
gluon radiation, hadronisation and the UE, as a function of R for pt ~ 50 GeV quark jets at the 
Tevatron; right: the resulting crude estimate for the "best" R as a function of jet pt, for quark 
and gluon jets at the Tevatron and LHC (14 TeV). These values are to be taken as indicative of 
general trends rather than reliable estimates of the best R. The plots use the same parameters 
as tableland the perturbative contribution is taken in the small- limit. Taken from [132j . 

If one uses jets for kinematic reconstruction, the considerations are different: when 
trying to identify a mass peak, for example, it is of little consolation that one can calcu- 
late the perturbative degradation of the peak if that degradation in any case causes the 
peak to disappear under the background. A very crude estimate of what goes on can be 
had by assuming that fluctuations in a jet's momentum due to perturbative radiation, 
hadronisation and UE are each proportional to their average effect. Adding the squared 
averages in quadrature gives fig. [12] (left) and the minimum provides an idea of the optimal 
R (as before, ignore differences between algorithms), and illustrates how the main relevant 
interplay is between perturbative radiation and the UE. The right-hand plot shows how 
the resulting optimal R varies with pf. gluon jets and high pt jets prefer larger R values 
(because of the greater relative importance of perturbative radiation), while one needs 
smaller R values at the LHC than at the Tevatron (the former has more UE). 

While fig. [19] is useful for understanding general trends (notably the need for large R 
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at high pt), it is not quantitatively rehable: the fluctuations in a jet's kinematics and the 
mean energy-loss due to gluon radiation are for example not proportional to each other; 
also, for i? ~ 1, the Ospjlni? approximation for perturbative energy loss is in itself poor 
since it neglects terms of O [asPt)', finally, for simplicity, it has been obtained neglecting 
differences between algorithms and this is not entirely legitimate. 

5.1.2 Numerical studies 

Given that a complete analytic treatment is not yet available, one can use Monte Carlo 
event simulation to examine the optimisation of the choice of jet definition. 

Historically the approach taken to studying the quality of jet definitions has been to take 
hard partons in a Monte Carlo, let them shower and hadronise, and then see how closely 
the reconstructed hadron-level jets match the original partons (see for example |168] ). This 
procedure has the drawback that it is conceptually impossible to extend it to advanced 
Monte Carlo tools like MC@NLO [49j, because the "original parton" is no longer iden- 
tifiable. Furthermore it leads to several distinct quality measures: whether the number 
of jets is equal to the number of "partons" , the angular distance between the jets and 
the partons, and the pt difference between the jets and the partons. It is then often not 
clear which of these measures is most representative of the algorithm's usefulness in a real 
physics analysis. 

The most robust way of proceeding would be, for each possible experimental study, to 
carry out a full signal and background analysis with a wide variety of jet definitions and 
then see which provides the best signal to background (or root-background) ratio. As well 
as being a major undertaking, this can have subtleties: for example, optimal cuts in an 
analysis may depend on the jet definition and so may need to be reoptimised for each new 
jet definition. 

An approach taken in |169[ I170[ ISTj IM] attempts to carry out a simplified version of 
this procedure. It takes a physical process, for example the production of a narrow Z' that 
decays to qq, or a tt event (t — )■ hadrons) and attempts to reconstruct the massive object. 
The "better" the reconstructed mass peak, the better the jet algorithm. This ignores issues 
like how the jet algorithm performs with respect to background events (which was however 
additionally studied in [H]), but is well-defined (no reference to partons) and avoids the 
appearance of multiple quality measures (angular dispersion, energy dispersion, etc.). Note: 
the physical process itself need not necessarily be realistic — e.g. it is highly unlikely that 
there exists an as-yet undiscovered, hadronically decaying Z' with mass 100 GeV. But it 
still serves as a useful stand-in for a generic qq resonance, whose mass scale can easily be 
varied. 

Quantifying the quality of the mass peak is, as it turns out, a complex issue. Two 
properties that matter are the position of the mass peak (how close it is to the mass scale 
of the particle being reconstructed) and its width. The position matters, for example, when 
trying to accurately reconstruct the mass of a hadronically decaying resonance. The width 
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matters in terms of being able to identify clear signal peaks over a smooth background. In 
this section we will concentrate of the width. That the width is not trivial to determine 
is evident from fig. |20l It is tempting to measure the peak quality by fitting a Gaussian. 
However, the fit is poor; the results of the fit depend on the choice of fit-window; and then 
it's not clear which parameters of the Gaussian would serve as the quality measure: the 
normalisation? The width? Some combination of the two? The approach of 1169^170] 157] is 
to avoid the fit-function and instead to find the width of the smallest window that contains 
a specified fraction z of the events, Qy=z- sharper (i.e. better) peak corresponds to a 
lower QJ=z value. There is still some arbitrariness in this, for example the choice of the 
fraction of events z (defined with respect to all events before cuts). However z is easily 
varied to check the robustness of the procedure and one can also examine other quality 
measures! 
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Fig.[2T]illustrates the procedure for a 100 GeV 
qq resonance and a 2 TeV gg resonance, exam- 
ining three jet definitions (the use of z = 0.12 
corresponds to taking about 25% of events af- 
ter cuts)Hll One sees how better peaks have 
lower values for Q^=zi together with the extent 
of the differences between algorithms, and the 
relevance of the choice of R. 
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Figure 20: Distribution of the recon- 
structed invariant mass of a 100 GeV qq 
system for the C/A algorithm with R = 0.3, 
simulated with Pythia 6.4, together with a 
Gaussian fit. 



The full /^-dependence of the quality mea- 
sure is shown for 5 algorithms in fig. for 
the same two physics cases. The minimum of 
each curve indicates the best R for that particu- 
lar algorithm, while the differences between the 
curves are illustrative of the different behaviour 
of the various algorithms. In particular, one 
sees the preference for larger R in the 2 TeV gg 
case. This is in accord with the expectations discussed in section 15.1.11 the larger the 
importance of perturbative gluon radiation, the larger the preferred R value. The optimal 
i? as a function of mass scale, for the different algorithms and for the qq and gg cases, is 
illustrated in fig. [221 The overall trend is not unlike the rough analytical estimate, fig. [12] 
(right), but the details differ: for example the full result doesn't show as rapid an R de- 
pendence, and it is not clear to which extent the optimal R saturates at the highest scales 
in fig. M 

Figure [22] shows that even at the optimal R there are differences between algorithms. 
The origin of these differences has not been analysed in full detail, but can almost certainly 
be traced back to the very different area properties of the various algorithms: kt fares worst 
because its larger area allows more UE into the jet, causing enhanced fluctuations of the 
kinematics from event to event; meanwhile SISCone, with its small area, fares well, as 
does the filtered version of the C/A algorithm, which resolves each jet on an angular 
scale R/2 and takes just the two hardest subjets (cf. section [2.2.6p . Ref. [86] has shown 



■^"One alternative is to fix the window width to be x GeV, place the window so as to maximise the 

1 / f 

number of events that it contains, and then use tffii3 inverse fraction of contained events Qw=x as the 
quality measure. A better peak concentrates more events in a given window, giving a lower result for 

^w—x • 

^^This and other figures in this subsection are all taken from [87], to which the reader is referred for full 
details. In the dijet case, a mass is reconstructed from the two hardest jets, with a cut |Ay| < 1 on the 
rapidity interval between the two jets (because in studies with background, such a cut greatly reduces the 
background) . 
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Figure 21: Illustrative dijet invariant mass distributions for two processes (above: qq case at 
M = 100 GeV; below: gg case at M = 2 TeV), comparing three jet definitions for each process. 
The shaded bands indicate the regions used when obtaining the quality measure. Note that 
different values of R have been used for the qq and gg cases. 



that additional modifications of the filtering procedure ("trimming") and tuning of its 
parameters can yield yet further improvements, as does noise "subtraction", discussed in 
section 15.21 

A key question in examining results such as those in fig. and [22] is how much the 
differences between algorithms (and -R-values) matter: from this information an experi- 
ment can then decide whether it is worthwhile investing in the calibration of multiple jet 
definitions (or in the inherent flexibility that allows easy use of any jet definition). One 
way of making such an estimate is to assume that the peak is being reconstructed in the 
context of a search with significant background. If one can assume that the amount of 
background basically scales as the width of the window that comes out of the Q^=z quality 
measure, then the significance S/ \fB of any signal will just be inversely proportional to 
One can then define a measure pl, which is the extra factor in luminosity that is 
needed to see the signal with given significance when using one jet definition (JD) relative 
to another. 

Results for pi are given in fig. |2H One sees that the impact of the jet algorithm choice 
is greatest at large mass scales (not surprising perhaps, given that large scales prefer large 
i?, where the area-sensitivity matters particularly). That figure also illustrates how, at 
large mass scales, especially for gluon jets (as dicussed first in |170j ). standard choices of 
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-R ~ 0.5 are extremely poor — requiring up to twice as much luminosity to see a mass peak 
above a background. This conclusion is relatively robust: fig. actually has results for two 
different quality measures, which agree remarkably well (the solid line derives from Q/=z, 
discussed above, while the dashed line stems from an alternative measure, cf. footnote [30|, 
or [87j for full details). 

A reader who wishes to examine these quality measures further is encouraged to consult 
a web-tool [171] . which provides access to over 100,000 plots, two different quality measures 
and z values, with histograms for a wide range of jet definitions and mass scales as well as 
summary plots of the quality measures and resulting values. 

Three comments are due concerning the above discussion. Firstly, it applies directly 
only to simple dijet events. There have also been studies with multijet events from top- 
quark decays, cf. the two bottom rows of fig. [24l as well as an extensive analysis in |172] . 
What emerges from these studies is that the best choices in dijet events (where SISCone 
works very well) may not be optimal in multijet cases. For example, in |172] SISCone 
had more difficulty resolving all relevant jets, while in fig. |2l] the acceptable range of R 
is somewhat narrower for SISCone than for other algorithms. A related point is that in 
multijet events the conclusion about the need for larger R at high scales is likely to confiict 
with the need to resolve the multiple jets. These are important issues and call for further 
study. 

A second comment is that refs. |169[ I170[ 187] did not carry out any detailed tests of 
how the presence of realistic background events affects the relation between mass scale and 
optimal R. However, the analysis to be described below, ref. [83], did include a study with 
background events, and confirms that at high masses large- i? values are preferred (though 
the cuts and other details differ somewhat from those in j87]). 

The final comment concerns the relation between the results here and those in table O 
Here we have seen that for high masses, large R values are preferred, whereas table E] 
showed that in order to minimise non-perturbative corrections one should use smaller R 
values. These results are not in contradiction. In the case of table El we had in mind 
observables such as the inclusive jet spectrum, where the effects of perturbative radiation 
just shift the distribution slightly, in a way that is accounted for within the NLO QCD 
predictions to which one compares experimental data (as long as R is not so small that 
the perturbative expansion breaks down). Therefore our aim was simply to minimise 
the poorly controlled non-perturbative contributions, leading to a preference for small R 
values. In the discussion in this section we considered the reconstruction of a hadronically 
decaying resonance. Perturbative radiation loss deforms the peak making it less visible 
above a smooth background. Though we could calculate that deformation pertubatively, 
our ultimate aim is to make the peak more visible and so a larger R value, which retains 
more perturbative radiation, becomes preferable. 

Variable R. One should be aware that there may also be benefits to be had by moving 
away from the use of a single R value even when studying a single mass scale. For example. 
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Algorithm 500 GeV 


1 TeV 




2 TeV 




3 TeV 


anti-A;^ anti-fc^ VR 18% (0.9, 200) 
C/A ^ C/A VR 17% (0.9, 175) 


14% (1.0, 450) 
14% (1.0, 400) 


10% 
7% 


(1.2, 1000) 
(1.2, 1000) 


8% 
9% 


(1.3, 1500) 
(1.3, 1500) 



Table 7: Percentage improvement in the number of events from a resonance X that have been 
reconstructed in the mass window mx ± 25 GeV, comparing a fixed- i? algorithm at its best R 
(first number in brackets) with the variable- algorithm (the second number in brackets, p/ GeV, 
sets the jet radius as R{pt) = p/pt)- Results taken from [8^ . 



one might choose to adapt R according to the amount of noise (UE, pileup) in each given 
event. Alternatively, other kinematic variables, like the total transverse energy in the 
event [123] or the rapidity separation between the leading jets, can also be correlated with 
the optimal R choice on an event-by-event basis. 

This last point was studied recently by Krohn, Thaler and Wang (KRT) [HI] with a 
variable- i? jet algorithm (cf. section l2.2.6p . Their R value is actually not directly a function 
of the rapidity difference. Ay, between the two hardest jets, but rather scales as l/ptjet, 
which for two hard jets stemming from a resonance of given fixed mass translates to a 
rapidity dependence R ~ cosh^. This was motivated on the grounds that jets from a 
resonance decay emit gluons on an angular scale that is independent of whether the jets 
are transverse or along the beam direction; in the centre-of-mass frame of the resonance, 
for two partons at rapidity y/2, separated by an angle dij <^ 1, the boost-invariant angular 
distance ARij is given by 9ij cosh |. Hence the scaling used for the jet radiusEl 

Table [7] illustrates the improvements in signal reconstruction that are obtained with 
this approach as compared to fixed- i? algorithms. The benefit is at the level of tens of 
percent. This is similar in magnitude to the improvement seen above by optimising the 
choice of algorithm, or optimising a fixed R as compared to a standard R = 0.5 or R = 0.7 
choice. 

The KRT analysis was performed using the two leading jets reconstructed from all 
particles with |?7| < 3. This means that significant numbers of events involve two leading 
jets with large rapidity separations. In this respect, the KRT analysis differs from that 
discussed above [i87j, which only studied events in which the leading jet pair was separated 
by I Ay I < 1. With the latter requirement, since cosh ^ would be close to 1, one might 
expect the pf-dependent variable- i? choice to have a more modest impact. 

KRT also studied jet performance for resonance reconstruction that includes a dijet 
background. Here too they found improvements with a variable R choice (and again at the 
level of about 10 — 15%), but only if they supplemented their analysis with a "jet quality" 
cut which requires that the energy be deposited centrally within the jet. The corresponding 
fixed- analysis confirmed the need for large R values at high mass scales. 

^^The optimal R clioice should of course depend also on initial state radiation and the underlying event, 
and further studies might benefit by taking into account this information too. 
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5.2 Pileup subtraction 



The LHC will collide protons with an unprecedented instantaneous luminosity of up to 
10^^ cm~^ s~^ and a bunch spacing of 25 ns, corresponding to 0.25 mb~^ per bunch 
crossing. While this high luminosity is essential for many searches of rare new physics 
processes at high energy scales, it also complicates analyses, because at each bunch crossing 
there will be of the order of 20 minimum bias pp interactions, which pollute any interesting 
hard events with many soft particles. The beams at LHC will have a longitudinal spread, 
and it may be possible experimentally to associate each charged particle with a distinct 
primary vertex that corresponds to a single pp interaction and so eliminate some fraction 
of the soft contamination. However, for neutral particles this is not possible, and many jet 
measurements are in any case expected to be carried out with calorimeter-cell or cluster 
information, for which there is not sufficient angular resolution to reconstruct the original 
primary vertex. Therefore kinematic measurements for jets will be adversely affected by 
pileup (PU), with resolution and absolute energy measurements suffering significantly. 

The impact of PU is illustrated in the upper row of fig. [251 which shows histograms for 
the same 2 TeV gg resonance used above, but now with varying degrees of pileup: none, 
low-luminosity LHC running (0.05 mb~^ per bunch crossing) and high-luminosity running 
(0.25 mb~^ per bunch crossing). The degradation of the peak and its shift to higher masses 
are clearly evident here. While the shift is perhaps not overly consequential (it could be 
corrected for by comparing to MC simulation of the pileup), the loss of resolution is a 
serious issue. 

Both the Tevatron and LHC experiments have examined the question of pileup. Some 
approaches to limiting its impact are based on average correction procedures, for example 
the requirement that final measured distributions should be independent of luminosity [1] , 
or a correction to each jet given by some constant times the number of primary interaction 
vertices (minus one) [175] . These approaches have the advantage of being simple, but their 
averaged nature limits the extent to which they can restore resolution lost through pileup. 
Other approaches involve event-dependent corrections that are applied to calorimeter tow- 
ers either before or during the clustering |176[ 1177] . These can give better restoration of 
resolution than average-based methods. One drawback that they have is that they are 
tightly linked to the specific experimental setup (for example calorimeter cell-size), and 
require ad-hoc transverse-momentum thresholds to distinguish pileup from hard jets. Ad- 
ditionally they are sometimes tied to specific (legacy) jet algorithms, and so may not always 
be readily applied to more modern jet algorithms. 

The above issues triggered the development of an experiment-independent pileup sub- 
traction approach in |174] . The essential observation is that pileup roughly modifies a jet's 
Pt as follows: 



where A is the jet's area (after addition of the pileup), p is the mean amount of transverse 
momentum per unit area that has been added to the event by pileup; a measures the 
fiuctuations of the pileup from point-to-point within the event (defined as the standard 




(54) 
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deviation of the distribution of pileup across many squares of area 1); and Apf is the 
net change in transverse momentum due to back reaction. Fluctuations in the jet pt 
arise because A varies from jet-to-jet, p from event-to-event, because the pileup density 
fluctuates from place to place within an event (the a term) and because of back reaction. 
The approach of |174] involves first running the jet algorithm, estimating p and then 
subtracting the Ap term from each jet. This leaves just the ay/A piece and the back- 
reaction term and should significantly reduce both the fluctuations and the mean offset in 
jet energylffl 

The estimation of p (without using detector-specific information, such as the origin of 
tracks) is non-trivial because one must decide what part of the event belongs to the hard 
event and which part comes from pileup. Ideally one wants to do this without introducing 
any explicit threshold to distinguish the two, given that the natural threshold would vary 
significantly from event to event. One observation of [174] is that this can be done using 
the jets themselves. Fig. fIEk shows a scatter plot of jet pt versus jet area for a single Monte 
Carlo event and one sees a clear correlation from nearly all the jets. Fig. l2Bb shows Pt/A 
for each jet as a function of rapidity for the same event and one sees that the pt/A results 
cluster around a value that is fairly independent of rapidity (it is still unclear just how 
true this will actually be at the LHC). These two features led to the proposal to take the 
distribution of pt/A for all jets in an event, up to some maximum rapidity, and to then 
use its median (robust with respect to outliers, i.e. hard jets) as an estimate of p0 That 
estimate gives the black line in fig. [261 while the band's width is controlled by the value of 
a obtained from examining the width of the pt/A distribution. 

With p estimated in this fashion, one can correct each jet by an amount: 

p^^^p^^-pA^, (55) 

where Aj is the jet's 4-vector area. This approach was used to obtain the lower row of 
fig. [251 illustrating the substantial gain in peak quality that is to be had with the method 
(as well as nearly correct reconstruction of peak position). In this specific case there is 
even an improvement in the peak quality in the case without pileup, a consequence of the 
fact that the above method also subtracts UE. 

One comment is that pileup subtraction does not completely eliminate the effect of 
pileup. In the area-based approach just described, this is because of the last two terms on 
the RHS of eq. (15^ are still present after subtraction. Nevertheless it reduces the impact of 
pileup sufficiently that conclusions about optimal jet definitions drawn from fig. [2l]in the 
absence of pileup also hold with pileup. This is important, because it means that analyses 

^^One might eliminate the back reaction too if one could subtract the PU before running the jet-finder — 
subtracting PU directly at particle level leads, however, to negative momenta, with consequent non-trivial 
issues within many jet-algorithms. 

^^One technical detail is that only certain jet algorithms, essentially kt and C/A, are suitable for esti- 
mating p, while the subtraction itself can be performed for any algorithm. When trying to optimise the 
subtraction there are also issues with the choice of the correct R value in the JD used to estimate p, with 
i? ~ 0.5 — 0.6 being a generally reasonably choice. 
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that use data taken at different luminosities can successfully use a common jet algorithm, 
independently of the pileup. 

Subtraction in heavy-ion collisions. The techniques described above have the poten- 
tial to be useful also in heavy-ion (HI) jet finding, where the problem is to identify jets 
given the large soft background of particles that results from the hot dense matter that 
is formed in a heavy-ion collision. This is of interest in the heavy-ion community (see for 
example the review [23]) because the modification of jets as they traverse the hot dense 
medium may provide insight into the nature of the medium. 

The proposal for area-based pileup subtraction |174j also included an application to 
the HI case and it has been investigated by the STAR collaboration at RHIC in a first 
prehminary measurement of jet-cross sections in Au Au gold collisions at VSnn = 200 GeV 

m- 

The value of p in the heavy-ion case is up to an order of magnitude larger than in 
high-luminosity pp running. This places particularly stringent constraints on the accuracy 
that is required in the subtraction and has spurred various ongoing investigations. Given 
the similarities between HI and high-pileup pp jet finding, it is to be expected that these 
investigations will be beneficial in both environments. 

Among the issues that are being considered is that of how to estimate p without recourse 
to the whole event, given that there is significant rapidity dependence (and even azimuth 
dependence in some cases) in the production of soft particles, both in HI and pp collisions 
(the event in fig. [26b is a little unusual in its degree of rapidity-independence). Other 
issues are those of minimising systematic residual shifts from back reaction (for which the 
anti-fcj algorithm is beneficial) and reducing the impact of point-to-point fluctuations in 
the noise (in this respect C/A filtering seems to offer a promising avenue). 

5.3 Substructure 

A key feature of the LHC is that it will be the first collider to probe scales that are 
significantly above the EW scale. This is what will allow the LHC to investigate the nature 
of electroweak symmetry breaking and explore new territory in the search for particles and 
phenomena beyond those of the standard model. 

The importance of physics at transverse momenta pt ^ mz has implications for the 
structure of the final state because at high transverse momenta, "signature" particles, W's, 
Z's, Higgs bosons and top quarks, have very coUimated decays (due to their relativistic 
boost). Standard approaches for identifying these particles (i.e. recombining different jets) 
fail because all the decay products end up in a single jet. 

The work so far on identifying hadronic decays of boosted heavy particles has fallen 
into two broad classes: particles with two-pronged decays (the EW bosons), and those 
with three-pronged decays (top quarks). In each case, the mass of the jet is one indicator 
of its origin (as discussed recently for example in |178[ I179[ I180[ I18H 1182 ]). However, even 
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for massless partons, QCD branching generates a significant fraction of jets with large 
masses (or equivalently with 2 or 3-pronged substructure): assuming a given jet pt, the 
leading-order (fixed-coupling) differential QCD jet-mass distribution goes as 



(see jl33j for more detailed analytic expressions, or |134[ 1136] for corresponding resummed 
results in e+e~collisions) and the logarithm can in part compensate the smallness of as, 
especially at larger pf. Two main questions that need to be answered are then: how can 
one reduce the background of QCD jets of a given mass, and how can one get the best 
resolution on jet mass so as to be able to use a small jet-mass window in selecting candidate 
heavy particles? 

5.3.1 Two-pronged decays 

The first detailed discussion of advanced jet techniques for two-pronged decays, over 15 
years ago, was given by Seymour in |183] in the context of a search for a heavy Higgs 
boson decaying to WW with one W decaying leptonically, the other hadronically. He 
mainly considered the issue of mass resolution and investigated two approaches. One 
method involved the (inclusive) kf algorithm, with i? = 1, in which the clustering sequence 
for the hardest jet was essentially undone by one step, so as to resolve the jet into the two 
subjets from the W decay. The resulting separation of the subjets could then be used to set 
a smaller R for a second run of the kt algorithm, which helped improve the mass resolution. 
Another method involved the use of a cone algorithm with quite small R, ~ 0.25 in order to 
directly identify the two subjets. This small R was needed in order to robustly resolve the 
two subjets, but that then caused it to lose significant gluon radiation from the W qq 
system, giving worse mass resolution than the kf algorithm. The basic observation was 
therefore that the kt algorithm's intrinsic internal information on substructure allowed one 
to be more flexible in the compromise between identifying substructure and capturing the 
bulk of the relevant radiation. 

The next development on the subject was made by Butterworth, Cox and Forshaw |184] 
who examined WW scattering, again with one leptonically and one hadronically decaying 
W. They observed that the distribution of kt distance, dij (eq. (|H])), between the two W 
subjets was close to the W mass in W decays, but tended to have lower values in generic 
massive jets. This allowed them to obtain a substantial reduction in the background. The 
same idea was used later for electroweak-boson reconstruction in the context of a SUSY 
search |185] . The tool associated with this technique is often referred to as "Y-splitter" . 

It is worthwhile looking at some simple analytic results that relate to the techniques 
of [184] and |183] . For a quasi-collinear splitting into two objects i and j, the total mass 
is ~ ptiPtjARfj. Labelling i and j such that ptj < Pti and defining z = Ptj/pt [Pt = 




(56) 
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Pti+Ptj), then 



^ z{l - z)piARfj , (57) 

d,, = zY.ARl ^ jT^m' . (58) 
[l-z) 

Electroweak bosons decay with a fairly uniform distribution in z (exactly uniform for a 
Higgs boson), whereas a QCD splitting has a soft divergence, e.g. 

P„«i±(lzi)!. (50) 

This means that for a fixed mass window, the background will have lower dij values than 
the signal. Indeed, the logarithm in eq. comes from the integral over the 1/z divergence 
in eq. (1591) . with lower limit z > /plR^. If one places a cut on d^j, or analogously on 
2, then one eliminates that logarithm, thus reducing the QCD background (one can even 
calculate, analytically, what the optimal cut is for given signals and backgrounds). 

A second set of observations concerns mass resolution. Firstly, with a small cone of size 
R <C ARij used to reconstruct the two prongs of a colour-singlet qq state, then there will 
be an average loss of (squared) mass, and correspondingly of mass resolution, dominated 
by a contribution from perturbative gluon radiation, 

(W) - 2m' ■ ^ fin -|- + O {I)] , R < Ai?^,- , (60) 

TT \ I\Rij J 



with ~ Cf as given in eq. f l30|) . If instead a single jet is used to reconstruct the whole 
qq system, then one can show that most of the perturbative radiation from the qq system 
will be contained in the jet. However there may then be significant contamination from 
the UE and pileup, 

((5m^) ~ ppt-^, (61) 

for a circular jet (cf. eq. dH]), with p = Aue/^ti), with an additional contribution coming 
also from perturbative radiation from the beam. Even though the above two equations 
represent major oversimplifications of the full dynamics, one can understand the task of 
optimising mass resolution as one of minimising both types of contribution (in analogy 
with section IS.l.ip . 

This understanding provided the backdrop to a two-pronged subjet technique given 
in [85], used there for a high-pf Higgs boson search in association with a back-to-back 
high-pf vector boson. The approach involved the Cambridge/ Aachen algorithm, because 
its sequential recombination in increasing angular distance is ideally suited to dealing with 
problems that involve multiple or unknown angular scales. The basic procedure that was 
used to identify a H ^ bb decay went as follows: 

1. Break a C/A jet j into two subjets by undoing its last stage of clustering. Label the 
two subjets ji, j2 such that m^^ > nij^. 
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Jet definition crs/fb as/fb fb 



C/A, i? = 1.2, MD-F 0.57 0.51 

kt, R = 1.0, y,ut 0.19 0.74 

SISCone, R = 0.8 0.49 1.33 

anti-fci, i? = 0.8 0.22 1.06 



0.80 
0.22 
0.42 
0.21 



Table 8: Cross section for signai (as) and tfie Z+jets background {(Tb) in the leptonic Z channel 
of HZ production at a 14 TeV LHC, for 200 < prz/GeV < 600 and 110 < mj/GeV < 125, with 
perfect 6-tagging; the C/A algorithm uses the procedure outhned in the text; the kt algorithm 
uses the first step of decomposition to identify two subjets with a cut on Uij as for C/A; SISCone 
and anti-Zct do not use any subjet analysis, but each require two 6-tags within the jet. In each 
case R has been chosen to give near optimal significance with that algorithm. 

2. If there was a significant mass drop (MD), m^^ < firrij, and the splitting is not too 

asymmetric, y = ^i,' AR'^^ j^ > ?/cut5 then deem j to be the heavy-particle 

neighbourhood and exit the loop (u was taken to be 0.67 and ?/cut = 0.09). Note that 

3. Otherwise redefine j to be equal to ji and go back to step 1. 

The search for a mass-drop, step 2, served to identify the point in the decomposition that 
involved significant hard substructure and, in the context of a Higgs-boson search, one can 
verify that the two subjets at that stage both have a 6-tag. The cut on ?/ ~ ^/(l — z) 
allows one to kill the logarithm for (fake 6-tag) QCD backgrounds in eq. fl56|) . By virtue 
of angular ordering [79], the two C/A subjets produced at that stage, each with opening 
angle equal to ARj^^j^, should contain nearly all the perturbative radiation from the bb 
system (i.e. eq. fl60l) is close to zero). They still tend to include too much contamination 
from the UE however, so one can then apply a filtering technique in which the two subjets 
are reexamined on a smaller angular scale Rfm and only the three hardest components 
(i.e. bbg) were retained. This essentially reduces the coefficient of the UE contamination 
in eq. (16T|) . The value used for Rait was specific to the jet, -Rmt = min(0.3, Ri,i/2), though 
this could perhaps be further optimised. 

A comparison of different jet algorithms for the ZH search channel for rriH = 115 GeV, 
with Z — e^e~ for PtH,Ptz > 200 GeV (and other cuts detailed in [85]) is shown in 
table [HI The C/A algorithm with the mass-drop and filtering (MD-F) is clearly the best 
both at extracting the signal and limiting the background. The kt algorithm fares poorly 
mainly because of its poor mass resolution (its larger area and fluctuations, cf . section 14.41 
make it intrinsically worse than the C/A algorithm, and it is shown without any filtering) . 
SISCone does quite well on the reconstruction of the signal, mainly because of its partic- 
ularly low sensitivity to UE contamination, but does poorly on the background rejection 

^^This j/cut is related to, but not the same as, that used to calculate the splitting scale in [1841 1185] . 
which use a dimensionful c?cut- 
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because it fails to correlate the b tagging with the subjet momentum structure, as does 
anti-fc^. It is probably fair to say that the defects of the algorithms could to some extent be 
resolved with refinements such as the use of jet finding with multiple R values. However 
it is only in the C/A algorithm that the use of multiple R values fits in naturally within 
the context of a single run of the jet finder, and the C/A algorithm provides an internal 
representation of the jet structure that makes it particularly easy to establish the right R 
values. 

For completeness, fig. [271 shows the results of the Monte Carlo simulation (particle-level) 
of the boosted pp — )■ HV Higgs-boson search, illustrating how this becomes a relevant 
search channel at the LHC (and one that provides clean access to the product of Hbb and 
ZZH, WWH couplings). Subsequent to the writing of the original version of this review, 
ATLAS confirmed that results similar to those of [85] are obtained when accounting for 
detector effects |186j . Related methods were also examined for a tiH search [187] and for 
Higgs-boson production in new physics events |188] . 

5.3.2 Three-pronged decays 



motivated in part because high-mass tt resonances are a feature of many new physics 
scenarios (see for example I178[ \19'2\ 1193] and references therein). 

The use of subjet structure in identifying hadronically decaying tops is a much more 
recent topics than for EW bosons, having developed mainly in the last two years. However 
many of the ideas are directly inspired from the two-pronged case. Aside from examining 
the jet-mass (whose distribution is calculated in detail at leading order in |133] ). techniques 
that have been investigated include subjet-decomposition with the kt algorithm [1911 [195], 
C/A subjet techniques |173] and pruning [881 EH]. Among the discriminating variables 
that are used, there are dij type variables |1951I196] . ^r-type variables [19^ I173[ [55 ] [59 ] 1187] 
and event-shape variables |194[ 1133] (in both spherocity-like variable in the plane 

transverse to the jet), constraints on a VT-subjet mass |194[ 1173] as well as other interjet 
correlation variables (a helicity angle 9h in |173] ). Most of the work has been geared to quite 
high-pt boosted tops in the simple environment of a resonance decaying to tbt, though one 
study has also been carried out of moderately boosted top-tagging in the busy environment 
oipp^tiH [T57] . 

A summary of results for top-tagging efficiencies and fake rates in the various high- 
Pt methods is given in table [Hi What emerges from this is that the C/A-based approach 
of [173] seems to offer good efficiencies and very good QCD-jet rejection, with the best signal 
significance and signal-to-background ratio ( "better than 6-tagging at high pt")- Compared 
to the C/A-based method of |85j for Higgs decays, particularly relevant differences are 
that it avoids any reference to mass-drops (thus simplifying the method), and introduces a 

■^^Though there has also recently been work on the three-pronged hadronic decay of a neutralino in an 
i?-parity violating SUSY scenario |189j . 



Three-pronged decays have been studied mainly in the context 
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Method 


efficiency 


fake fraction 


(from \m\) 


just jet mass 


50% 


10% 


ATLAS |195l 1196] 


3,4 kt subjets, dcut 


45% (85%) 


5% (15%) 


Thaler & Waner |194] 


2,3 kt subjets, Zcut + shape 


40% 


5% 


Kaplan et al. jl73j 


3,4 C/A subjets, z^ut + Oh 


40% 


1% 


Ellis et al. [SSI ESj 


Pruning 


15% 


0.05% 


CMS [TW] 


variant of |173j 


45% 


3% 



Table 9: Efficiencies for reconstructing top quarks with pt ^ 1 TeV and the fraction of normal 
QCD jets that get a fake "top-tag". Sliown for the various tagging metliods that quoted numbers 
easily amenable to interpretation in this manner (numbers in brackets are for alternative sets of 
cuts). Insofar as results involve different detector and resolution assumptions, Monte Carlo gen- 
erator choices, as well as slightly different pt cuts, the comparison should be considered indicative 
rather than precise. Furthermore the results of the various methods have different dependences 
on the transverse momenta being studied. 
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M (GeV) M (GeV) 



Figure 23: The optimal value for i? as a function of the mass of the qq/gg system (left/right) , 
as determined from the quality measure for various jet algorithms. Note that the exact 

results for the optimal R depend a little on the choice of quality measure, however the observed 
trends do not. 

minimal distance between subjects, which needs to be adjusted with the jet Pf Its signal 
efficiency and fake-tag rates are shown as a function of transverse momentum in fig. 
(left). The decrease in efficiencies at high transverse momenta is simply a consequence of 
the inclusion of finite calorimeter tower sizes (0.1 x 0.1), and could perhaps be alleviated 
experimentally with the use of tracking and electromagnetic calorimetry, both of which 
have finer angular resolution. In tests by CMS [197J of a variant of the method, the 
efficiency instead saturates near 45% at high-pt, while it is the fake rate that progressively 
degrades. 

One important point in top-tagging is that to obtain the best tagging at high transverse 
momenta, one should use an R value that scales as 1/pt, because the top-quark decay is 
mostly contained in a cone of width of order 2 — 3 times m/pf Using a jet-opening angle 
that is much larger than this will lead to considerable degradation in mass resolution, not 
only because of UE contamination (as in the colour-singlet two-body decay case), but also 
because the top quark, a coloured object, itself radiates gluons, which will tend to increase 
the jet mass. The C/A-based approach of |173j is to some extent able to find the right R 
automatically for a given top-decay, and this is part of its strength. 

Finding the top quark is only half the task however: one must also establish its mo- 
mentum. Barely any of the gluons emitted from a fast-moving top quark are contained in 
the small jet used to identify the top — this is the dead-cone phenomenon for radiation 
from a massive quark. To capture them one should instead use a jet with large opening 
angle, as one would for a high-pt light quark ^7], cf. section This is essential if one is 
to obtain good mass resolution on a ti resonance and is summarised in fig. [2H] (right). 

Thus, when studying highly boosted tops, one needs to examine the event on two 
angular scales: quite small R ~ 3mt/pt to tag the top-decay structure, and large i? ~ 1 to 
reconstruct the top-quark momentum before the top started emitting gluons. 
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kt C/A anti-kt SISCone C/A-filt 




Figure 24: For each process (one per row) this plot shows the extra factor in luminosity, pL, 
required in order to obtain the same significance as with the best jet definition, as a function 
of R. The (red) solid line corresponds to the estimate oi pi based on the minimal width QJ^^^ 

1/ f 

while the (blue) dotted line corresponds to that based on the maximal fraction Q^_^ 2h\/M 
footnote [30]). 



76 





0.04 - 


c 


0.03 - 


!q 








Z 

T3 


0.02 - 


z 






0.01 - 



no pileup 



gg 2 TeV, k,, R=1 .0 
Qr=o.i3 = 80 GeV 



pileup 0.05 mb"' pileup 0.25 mb"' 




1900 2000 2100 2200 
dijet mass [GeV] 



1900 2000 2100 2200 
dijet mass [GeV] 



1900 2000 2100 2200 
dijet mass [GeV] 



Figure 25: Invariant mass distributions for the 2 TeV gg process of section 15.1.21 for the kt 
algorithm with R = 1, shown with no pileup (left), low pileup (middle) and high pileup (right), 
without subtraction (upper row) and with pileup subtraction as outlined in [174j (lower row). 
The shaded bands indicate the region used to calculate the QJ=z Quality measure in each case. 
Figure taken from |87] . 



5.4 Summary 

It is probably fair to say that the question of how best to use jets is still in its infancy. 
Nevertheless, some clear results have emerged from the above discussion. 

• There will not be a single "best" jet definition (i.e. R and jet-algorithm) at the LHC 
What's optimal will depend on what one wants to measure. The trade-off will be be- 
tween resolving separate jets (not really discussed here), capturing their perturbative 
radiation and limiting UE contamination, and depends on the momentum scale of 
an event, the number of jets, and so forth. In particular, kinematic reconstructions 
prefer larger R values at high pt and for gluon jets (even R > 1), because of the 
increased importance of capturing perturbative emission from the jets. 

• Monte Carlo studies of dijet resonances confirm this picture. They also indicate that 
among the different algorithms, kt is worst and SISCone and Cambridge/ Aachen 
with filtering are best (cf. fig. [2^ . This is in accord with expectations based on their 
areas, i.e. their sensitivity to the UE, and is most relevant at high scales. Differences 
between algorithms, expressed as the extra luminosity needed to obtain a given (toy) 
signal significance, are at the level of a few tens of percent. 
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Figure 26: a) Scatter plot of the jet transverse momentum ptj versus its area Aj, for an LHC 
dijet event with a pileup of 22 minimum bias interactions (simulated with the default tune of 
Pythia 6.325 |102j ). The line and band are given by pAj it o^J~Aj. b) The ratio ptj/Aj as a 
function of the rapidity, yj, for the same event; the line and band are given by p it cr/y^ (A). 
Taken from |174j . 

Furthermore, it seems that event-dependent choices for the R value can lead to 
additional improvements of a similar order of magnitude. 

• Pileup is a major issue, and significantly degrades kinematic reconstructions, even 
at high momentum scales. One can devise tools to measure the amount of pileup 
event-by-event and to subtract it jet-by-jet. This leads to a noticeable improvement 
in kinematic reconstruction quality, though it does not quite restore it to the level of 
the no-pileup case. 

• At the LHC's highest momentum scales, electroweak-scale particles appear to be 
light. Their decays are coUimated into single jets. Sequential-recombination jet 
algorithms provide a clean way of resolving the consequent substructure, and the 
most flexible seems to be Cambridge/Aachen, again with filtering to reduce UE 
contamination. 

There are many remaining open questions. Among them: how to reconcile the need for 
large R at high pt with the task of resolving complex multi-jet events; how this connects 
with the use of substructure is resolving highly boosted decays; how to calculate the opti- 
mal R analytically, perhaps using the resulting information event-by-event; how to choose 
the parameters of filtering and how this ties in with possible improvements in pileup sub- 
traction; and how all of this works in full physics studies, including realistic backgrounds 
and detector effects. It is to be hoped that future work will cast light on these questions. 

6 Conclusions 

This review has covered a range of developments in the practical and theoretical aspects of 
jet finding over the past few years. These are steps on the way to a fully developed science 
of the use of jets, "jetography" . 
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Mass (GeV) Mass (GeV) 

Figure 27: Signal and background for a 115 GeV SM Higgs in the pp — )• VH channels, with 
H bb, simulated using Herwig 6.5 and Jimmy 4.31 (an ATLAS tune), C/A MD-F with R = 1.2 
and pt > 200 GeV, for 30 fb~^. The b tag efficiency is assumed to be 60% and a fake-tag 
probability of 2% is used. The qq sample includes dijets and tt. The vector boson selections are 
(a) two leptons, (b) missing energy and (c) lepton plus missing energy, while (d) shows the sum 
of all three channels (see [85] for details). The errors reflect the statistical uncertainty on the 
simulated samples, and correspond to integrated luminosities > 30 fb~^. 

One important development is that LHC now has access to a range of fast, infrared- 
and collinear-safe algorithms, together with methods that allow any of the algorithms to 
be used in a high-luminosity LHC environment. IRC safety is essential if the LHC is to 
benefit maximally from the huge predictive effort that is ongoing within the QCD theory 
community. Practicality is a necessary condition for the algorithms to be used in an 
experimental context. 

A number of these advances have been taken up by the LHC experiments. For example 
both ATLAS and CMS incorporate Fast Jet within their software frameworks. At the time 
of writing (v2 of the arXiv version of this report), all 4 LHC experiments have seen first 
collisions at a centre-of-mass energy of ^/s = 900 GeV and it appears that both ATLAS 
and CMS have used the anti-fct algorithm for finding jets in this initial data. These are 
welcome developments given the importance of IRC safety for straightforward comparisons 
with perturbative QCD predictions and for the use of perturbative methods in generally 
thinking about jets. 

The second main development is that theoretical work has started on the question 
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Figure 28: Left: signal efficiency for boosted top ID, et, and fake-tag rates for quark and gluon 
jets {eq,€g, both multiplied by 10 for visibility) for the Kaplan et al. C/A-based top-tagger, as a 
function of jet pt (reproduced from [173]). Right: the use of two jet sizes for top reconstruction: 
the inner cone, of order a few times m/pt, includes the top decay products, but excludes radiation 
from the top quark itself (dead-cone). To capture that radiation and reconstruct the correct top 
Pt, one should use the outer cone. 



of how best to use jets in an LHC-type environment. This is an important question 
because the LHC spans two orders of magnitude in jet energy and has substantial (and 
variable) pileup, and no single jet definition will work optimally for the whole range of 
LHC phenomena. 

Progress has been outlined here (section H]) on our analytical understanding of how jets 
behave, and in section O we have seen a handful of examples that benefited significantly 
from the use of the "right" jet-finding approach. Currently these two aspects of work on jets 
are connected qualitatively: the understanding of section S] helped to interpret the results 
and inspire some of the methods of section [51 However a rigorous, quantitative link is still 
missing, and section [5] in any case covered only a small fraction of the possible use-cases 
for jets. This highlights a clear path for future work: that of bringing our analytical tools 
to bear on the full range of uses of jets at the LHC, so as to identify optimal jet-finding 
solutions across the board. 
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